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Description 

CHROMOSOME 1 GENE AND GENE PRODUCTS 
RELATED TO ALZHEIMER'S DISEASE 

5 - 

Technical Field 

The present invention relates generally to Alzheimer's disease, and more 
specifically to methods and compositions for use in diagnosis and treatment of 
Aizlieimer's Disease. 

10 

Background of the Invention 

Alzheimer's disease (AD) is a devastating, neurodegenerative 
progressive disorder first recognized in 1907 (Alzheimer, Algemeim Zeitschrift fur 
Psychiatric 6^:146-148, 1907). AD is a common disease in the elderly and is the 
1 5 predominant cause of dementia in people over 65 years of age. The prevalence of AD 
is estimated to be as high as 18,7% and 47.2% for the 75-84 year and > 85 year age 
groups respectively. Thus, there is a large affected population in most countries of the 
world. 

Clinical symptoms of the disease typically begin with subtle short term 

20 memory problems. As the disease progresses, difficulty with memor>', language, and 
orientation worsen to the point of interfering with the ability of the person to function 
independently. Other symptoms, which are variable, include myoclonus and seizures. 
Duration of AD from the first symptoms of memory loss until death is 10 years on 
average, but may range from 6-8 years to more than 20 years. AD always results in 

25 death, often from respiratory -related illness. 

The pathology in AD is confined exclusively to the central nervous 
system (CNS). The predominant features arc amyloid deposits (plaques) and 
neurofibrillary tangles (NFT). In AD, amyloid is found associated with the vascular 
system of the CNS and as focal deposits in the parenchyma. The major molecular 

30 component of amyloid is a highly hydrophobic peptide called the AP peptide. This 
peptide aggregates into filaments in an anti-j^plcated sheet structure resulting in the 
birefringent nature of the AD amyloid. While Ap is the major component of AD 
amyloid, a partial list of other proteins associated with the amyloid includes a-l-anti- 
chymotrypsin (Abraham, et al.. Cell 52:487-501, 1988), cathepsin D (Cataldo, et al., 

35 Brain Res, 573:181-192, 1990), non-amyloid component protein (Ueda. et al., Proc. 
Natl Acad ScL USA 90:\ 1282-1 1286, 1993), apolipoprotein E (apoE) (Namba, et al.. 
Brain Res. 5^1:163-166, 1991; Wisniewski and Frangione, Neurosci. Lett 
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/ii:235-238, 1992; Slrittmatter, et al.. Proc. Nat. Acad Sci USA 9^*1977-1981, 1993), 
apolioprolein J (Choi*Mura, et al.. Acta Neuropathol. ^5:260-264, 1992: McGeer, et al.. 
Brain Res. 579:337-341, 1992), heat shock protein 70 (Hamos, cl a!.. Neurology 
^7:345-350, 1991). complement components (McGeer and Rogers, Neurology 
5 ^i:447-449, 1992), aj.macroglobin (Strauss, cl al., Lak Invest. 66:223-230, 1992), 
interleukin-6 (Strauss, et al.. Lab. Invest. 6(5:223-230, 1992), proteoglycans (Snow, et 
aL, Lab. Invest. i^:454-458, 1987), and scrum amyloid P (Coria, cl aL, I^b. Invest. 
5^:454-458^ 1988). Surrounding many plaques are dystrophic neurites, which are nerve 
endings containing abnormal filamentous structures. Plaques are often surrounded by 
10 astrocytes and activated microglial cells expressing immune-related proteins, such as 
the MHC class 11 glycoproteins HLA-DR, HLA-DP, and HLA-DQ as well as MHC 
class I glycoproteins, intcrleukin-2 (IL-2) receptors, and IL-1. The other dominant 
feature of AD neuropathology is the presence of NFTs. These consist of abnormal 
filaments bundled together in neuronal cell bodies. "Ghost" NFTs are also observed in 
15 AD brains, which presumably mark the location of dead neurons. Other 
neuropalhological features include granulovacuolar changes, neuronal loss, gliosis and 
the variable presence of Lewy bodies. 

In the AD brain, the destructive process of the disease is evident on a 
gross level. In the late-stage of AD, ventricular enlargement and shrinkage of the brain 
20 can be observed by magnetic resonance imaging. On autopsy, extensive gliosis and 
neuronal loss are observed. Thus, the amyloid plaque structures and NFTs observed al 
autopsy are most likely the end-points of a lengthy disease process, far removed from 
the initiating events of AD. Also, the cells remaining at autopsy are grossly different 
from tliose of a normal brain. Neurons, which were possibly involved in initiating 
25 events, arc absent and other cell types, such as the activated microglial cells and 
astroctyes, have gene expression patterns not observed in the normal brain. Thus, 
attempts using biochemical methods to identify key proteins and genes in the initialing 
steps of the disease are hampered by the fact thai il is not possible to actually observe 
these critical initiating events. Rather, biochemical dissection of the AD brain at 
30 autopsy is akin to molecular archeology, attempting to reconstruct the pathogenic 
pathway by comparing the normal brain to the end-stage disease brain. 

Substantial evidence has suggested that inherited genetic defects are 
involved in AD. Numerous early-onset kindreds have been described (Bird, el aL, Ann. 
NeuroL 2J:25-3I, 1988; Bird, et al., Ann. NcuroL 25:12-25, 1989: Cook, ct al, 
35 Neurology 29:1402-1412, 1979; Feldman, et aL, Neurology^ /i:81l-824, 1960; 
Goudsmit, / Neurol. Sci. 49:19-. 1981; Heston and White, Behavior Genet. ^:3 15-33 1, 
1978; Martin, et al.. Neurology^ ^/:62-68, 1991; Nee, et al., Arch. NeuroL ¥0:203-208, 
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1983; van Bogaeert, et ai., Mschr Psychait. Neurol, 102:249-301. 1940; Wheelan, Ann. 
Hum. Genet. 25:300-309, 1959). (Early-onset is defined herein as onset prior to 65 
years.) Families with multiple late-onset AD cases have also been described (Bird, et 
al., Ann. NcuroL 25:12-25. 1989; Heston and White, Behavior Genet. 5:315-331, 1978; 
5 Pericak- Vance, el aL, Exp. Neurol. 10221 1-21% 1988). In addition, twin studies have 
documented that monozygotic twins are more concordant in their AD phenolype than 
dizygotic twins (Nee, et al., Neurology 37:359-363, 1987; (133} ). Also, the families of 
concordant twins have more secondary cases of AD than families of discordant twins 
(Rapoport, ct al. Neurology 4I:\ 549- 1 553 J 99 1 ). 

10 Genetic dissection of AD has been complicated by the complexity of the 

disease and the limited accuracy of its diagnosis. Because AD is common in the 
elderly, clustering of cases in a family may occur by chance, representing possible 
confounding non-allelic genetic heterogeneity, or etiologic heterogeneity with genetic 
and non-genetic cases co-existing in the same kindred. In addition, the clinical 

1 5 diagnosis of AD is confoimded with other dementing diseases common in the elderly. 

Despite the problems associated with resolving complex genetic 
diseases, 2 causative AD loci and 1 risk -modifying gene have been identified. 
Mutations in the amyloid precursor protein (APP) gene on chromosome 21 cause early- 
onset (< 65 years) autosomal dominant AD (Goate, et al.. Nature 349:704, 1991). 

20 Mutations in a recently identified gene (AD3) on chromosome 14 also result in early- 
onset autosomal dominant AD (Schellenberg, et al.. Science 25H:66Z, 1992; 
Sherrington, et al.. Nature J7J:754-760, 1995). For late-onset AD, the APOE gene has 
been identified as a genetic modifying factor (Strittmatter, et al.. Proc. Natl. Acad Set. 
USA 90:1977, 1993; Corder, et aL. Science 2(5/:92K 1993; Cordcr, et al., Nat. Genet. 

25 7:180-184, 1994; Benjamin, et ai.. Lancet 344:473. 1994; Smith, ct al.. Lancet 
344:473-474. 1994). 

The known genetic loci for AD do not account for all cases of AD. For 
example, in late-onset AD approximately half of AD cases do not have the APOE e4 
allele (Brousseau, et al.. Neurology 342, 1994; Kuusisto, et al.. Brit. Med. 1 309:636. 

30 1994; Tsai, et al.. Am. J. Hum. Genet. 54: 643, 1994; Liddel, et al.. J. Med. Genet. 
J/; 197. 1994). Also, in the Volga German (VG) kindreds (Cook, et al.. Neurology^ 
29:1402, 1979; Biri et al., Ann. Neurol. 23:25. 1988; Bird, el al., Ann. Neurol. 25:12. 
1989; Bird, Am. Hist. Soc. Germ. Russia J. ^9:1991; Bird, ei al., in Heterogeneity of 
Alzheimer^s Disease, F. Boiler, et al., Eds. (Spring-Vcrlag, Heidelberg, 1992) pp. 

35 118-129), as in several other families with high incidence of AD, the known AD loci 
have been excluded as possible causes (Schellenberg, et al.. Science 255:668, 1992; 
Lannfell, et al., Nat. Genet. 4:218-219. 1993; van Duijn, et aJ., Am. J. Hum. Genet. 
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JJ:714-727, 1994; Schellenberg, et al.. Science 24iASQl, 1988; Schellenberg, et al.. 
Am. J Hwn, Genet 48:S6X 1991; Schellenberg, et al.. Am, J Hum. Genet. 49:S\\-S\1, 
1991; Kamino, et al., Am, 1 Hum. GeneL J7:998, 1992; Schellenberg, el al.. Am J. 
Hum. Genet. Ji:619, 1993; [13]; Schellenberg, et zUAnn. Neurol 57:223, 1992; Yu, et 
5 ah. Am. J. Hum. Genet. 5^:631, 1994). Identification of new genes should add 
considerably to the unfolding of the genetic determinants and enable biochemical and 
genetic approaches to diagnosis and treatment. 

The present invention provides a novel, previously unidentified locus for 
AD, methods and compositions for diagnosis and treatment of AD, and further provides 
1 0 other, related advantages. 



Summary of the Invention 

Briefly slated, the present invention provides isolated nucleic acid 
molecules encoding an AD4 (also known as STM2) gene product. Within one 

15 embodiment, a representative nucleic acid molecule is provided in Figures lA and B 
(hereinafter referred to as Figure 1 ). Within other embodiments, nucleic acid molecules 
are provided which encode a mutant AD4 gene product that increases the probability of 
Alzheimer's Disease (in a statistically significant manner). One represenUiive 
illustration of such a mutant is an amino acid subsitution at residue 141, wherein, for 

20 example, an isoleucine may be substituted for an asparagine. 

Within other aspects of the present invention, isolated nucleic acid 
molecules are provided, selected from the group consisting of (a) an isolated nucleic 
acid molecule as set forth in Figure I or complementary sequence thereof, (b) an 
isolated nucleic acid molecule that specifically hybridizes to the nucleic acid molecule 

25 of (a) under condilions of high stringency, and (c) an isolated nucleic acid that encodes 
an AD4 gene product. As utilized herein, it should be understood that a nucleic acid 
molecule hybridizes "specifically" to an AD4 gene (or related sequence) if it hybridizes 
detectably to such a sequence, but does not significantly or deteciably hybridize to the 
AD3 gene sequence under the same conditions. Within other aspects, isolated DNA 

30 molecules are provided that contain genomic sequences, such as the sequences 
presented in Figures 13-19. 

Within other aspects, expression vectors are provided comprising a 
f>romoter operably linked to one of the nucleic acid molecule described above. 
Representative examples of suitable promoters include tissue-specific promoters, as 

35 well as promoters such as the CMV I-E promoter, SV40 early promoter and MuLV 
LTR. Within related aspects, viral vectors are provided that are capable of directing the 
expression of a nucleic acid molecule as described above. Representative examples of 
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such viral vectors include herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. Also provided are host cells (e.^.. 
human, dog, monkey, rat or mouse cells) which carry the above-described vectors. 

Within other aspects of the present invention, isolated proteins or 
5 polypeptides are provided comprising an AD4 gene product, as well as AD4 peptides of 
greater than 12, 13, or 20 amino acids. Within one embodiment, a protein is provided 
that has the amino acid sequence set forth in Figure 2. Within another embodiment, the 
protein is a mutant ADA gene product that increases the probability of Alzheimer*s 
Disease. Such mutants include those witli an amino acid subsitution at residue 141 

10 (e.g., an aspargine to isoleucine substitution). Within yet a further embodiment, AD4 
peptides are provided which are composed of 1 3 to 20 amino acids derived or selected 
from the N -terminal, internal, or carboxy-terminal hydrophilic regions. 

Within yet another aspect of the present invention, methods of treating 
or preventing Alzheimer's Disease are provided, comprising the step of administering to 

1 5 a patient a vector containing or expressing a nucleic acid molecule as described above, 
thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in the. 
patient. Within a related aspect, methods of treating or preventing Alzheimer's Disease 
are provided, comprising the step of administering to a patient a protein as described 
above, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in 

20 the patient. Within yet another related aspect, methods arc provided for treating or 
preventing Alzheimer's Disease comprising the step of administering to a patient an 
antibody specific for an AD4 protein as described above, thereby reducing the 
likelihood or delaying the onset of Alzheimer's Disease in the patient. Within certain 
embodiments, the above methods may be accomplished by in vivo administration. 

25 Also provided by the present invention are pharmaceutical compositions 

comprising a nucleic acid molecule, vector, host cell, protein, or antibody £is described 
above, along with a pharmaceutical ly acceptable earner or diluent. 

Within other aspects of the present invention, antibodies are provided 
which specifically bind to an AD4 protein, or to unique peptides derived from the N- 

30 terminal, internal, or carboxy-terminal hydrophilic regions. As utilized herein, it should 
be understood that an antibody is specific for an AD4 protein if it binds detectably, and 
with a Ky\ of lO'^M or less {e.g., lO'^M, lO'^M, etc.), but does not bind detectably (or 
with an affinity of greater than lO'^M^ (e.g.. lO'^M, lO'^M, etc.) to the AD3 protein. 
Also provided are hybridomas which are capable of producing such antibodies. 

35 Within other aspects of the present invention, nucleic acid probes are 

provided which are capable of specifically hybridizing (as defined above) to an AD4 
gene under conditions of high stringency. Within one related aspect, such probes 
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comprise at least a portion of the nucleotide sequence shown in Figure 1 or its 
complementary sequence, the probe being capable of specifically hybridizing to a 
mutant AD4 gene under conditions of high stringency. Within one particularly 
preferred aspect, probes are provided that arc capable of sj>ecifically hybridizing to a 
5 mutant AE>4 gene in which amino acid residue 141 is changed from asparagine to 
isoleucine, under conditions of very high stringency. Within other related aspects, 
probe are provided which are capable of specifically hybridizing to at least a portion of 
the nucleic sequence shown in any of Figures 13-19. Representative probes of the 
present invention are generally at least 12 nucleotide bases in length, although they may 

10 be 14, 16, 18 bases or longer. Also provided are primer pairs capable of specifically 
amplifying all or a portion of any of the nucleic acid molecules disclosed herein. 

Within other aspects of the invention, methods arc provided for 
diagnosing a patient having an increased likelihood of contracting Alzheimer's Disease, 
comprising the steps of (a) obtaining from a patient a biological sample containing 

15 nucleic acid, (b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant AD4 gene under conditions and for lime sufficient 
to allow hybridization to occur, and (c) detecting the presence of hybridized probe, and 
thereby determining that said patient has an increased likelihood of contracting 
Alzheimer's Disease. Within another aspect, methods are provided comprising the 

20 steps of (a) obtaining from a patient a biological sample containing nucleic acid, (b) 
amplifying a selected nucleic acid sequence associated with a mutant AD4 gene, and (c) 
detecting the presence of an amplified nucleic acid sequence, and thereby determining 
that the patient has an increased likelihood of contracting Alzheimer's Disease. Within 
yet another aspect, methods are . provided comprising the steps of (a) contacting a 

25 biological sample obtained from a patient with an antibody that specifically binds to a 
mutant AD4 protein, under conditions and for a time sufficient to allow binding of the 
antibody to the protein, and (b) detecting the presence of the bound antibody. Suitable 
biological samples include nucleated cells obtained from the peripheral blood, from 
buccal swabs, or brain tissue, 

30 Within another aspect, peptide vaccines are pro vied which comprise a 

portion of a mutant AD4 gene product containing a mutation, in combination with a 
pharmaceutically acceptable carrier or diluent. 

Within yet another aspect, transgenic animals are provided whose germ 
cells and somatic cells contain an AD4 gene which is operably linked to a promoter 

35 effective for the expression of the gene, the gene being introduced into the animal, or an 
ancestor of the animal, at an embryonic stage. Within one embodiment, the animal is a 
mouse, rat or dog. Within other embodiments, the AD4 gene is expressed from a vector 
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as described above. Within yet another emhodimenU the AD4 gene encodes a mutant 
AD4 gene product. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
5 various references are set forth herein which describe in more detail certain procedures 
or compositions (e.g., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 

Brief Description of the Drawinus 
1 0 Figures I A and 1 B depict the nucleotide sequence of an AD4 gene. 

Figure 2 depicts the amino acid sequence of an AD4 gene product. 
Figure 3 depicts the sequence alignment of a representative AI>4 gene 
product to the AD3 gene product. Vertical bars between the sequences indicate 
sequence identity. Colons between the sequences indicate strong sequence similarity. 
1 5 Dots between the sequences indicate medium sequence similarity. Dots inserted into 
either sequence indicate gaps inserted into a sequence so as to maximize the score of 
the alignment. 

Figure 4 depicts a hydropathy plot of an AD4 gene product. The plot 
indicates that there are 7 putative transmembrane regions of this gene product, 2ind 

20 indicates a mutation at residue 141 (asparagine to isoleucine). 

Figure 5 depicts certain sequence features of a representative AD4 gene 
product, including a conserved sequence of 4 amino acids (KPGL) in positions 202-205 
and 251-254, each of which lie at the C-terminus of a putative membrane spanning 
regions and are conserved in the human and mouse AD3 gene products. Figure 5 also 

25 depicts a putative glycosylaiion site (asparagine 366). 

Figure 6 is a dotplot which compares the human AD3 gene product and 
a representative AD4 gene product. The dark blocks indicate sequence similarity 
between the putative membrane spanning regions. 

Figure 7 depicts a family tree shovidng the segregation of chromosome 1 

30 markers in the R family. The bars beneath each subject represent minimum 
recombinant haplotypes constructed using the 23 markers from D1S238 to DIS235 
(Table 2). Alleles shown (from top to bottom) are for D1S249, D1S237, D1S479. 
D1S439 and D1S225, where A is the largest published allele. The solid bar represents 
the haplotypes segregating with the disease. Other haplotypes are represented by bars 

35 with different hatching or marking. A partially missing bar indicates markers not 
genotyped (subject IV- 15). A " " symbol represents regions where the phase cannot be 
assigned. APOE genotypes are shown directly above the haplotypes with 3 and 4 being 
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the e3 and £4 alleles, respectively. Recombinants were observed on the disease 
haplotypes between D1S479/DIS439 and D1S225 in subjects V-l and V-3, and 
between D1S306 and D1S3I0 in subject IV- 18 (not shown). In IV- 16 there is a 
recombination between DIS217 and D1S237. A total of 4 recombinations (3 on non- 
5 disease related chromosomes) were observed between DIS249 and D1S237 in the 27 
chromosomes in generation IV. The distance between these markers is 12 cM and the 
chance thai up to 4 recombinants would be observed is 41%. 

Figure 8 depicts the location of chromosome I markers. Distances are 
given in centimorgans (sex -averaged) with the use of the Kosambi map function. The 3 
1 0 maps are not independent as all contain some common genotypes. 

Figure 9 sets forth the sequences of primers used to amplify 
polymorphic regions of chromosome 1 . 

Figure 10 presents the DNA sequence of clone 788iih5, EST T03796 
and clones Ell 1.1 and ELL 1.2. 
15 Figure 11 presents the DNA sequences of primers used in PCR or 

sequencing of the AD4 gene sequence. 

Figure 12 presents an elhidium bromide stained gel of genomic DNA 
amplified with WWF and INTIR primers. Lanes are: 1, at-risk, 79 years; 2, affected, 
onset 60 years; 3, at-risk, 67 years of age; 4, affected, onset 75 years (presumed 
20 phenocopy); 5, Boehringer Mannheim DNA size standard V; 6, affected, onset at 52 
years; 7, affected, onset 46 years; 8, affected, onset 49 years. 

Figure 13 presents a partial genomic DNA sequence of AD4. 

Figure 14 presents a partial genomic DNA sequence of AD4. 

Figure 15 presents a partial genomic DNA sequence of AD4. 
25 Figure 1 6 presents a partial genomic DNA sequence of AD4. 

Figure 17 presents a partial genomic DNA sequence of AD4. 

Figure 1 8 presents a partial genomic DNA sequence of AD4. 

Figure 19 presents a partial genomic DNA sequence of AD4. 

Figure 20 shows expression pancm of STM2 in different tissues. 

30 

Detailed Description of the Invention 
Definitions 

Prior to setting forth the invention in detail, it may be helpful to an 
35 understanding thereof to set forth definitions of certain terms and to list and to define 
the abbreviations that will be used hereinafter. 
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"Genetic marker" is any segment of a chromosome that is 
distinguishably unique in the genome, and polymorphic in the population so as to 
provide information about the inheritance of linked DNA sequences, genes and/or other 
markers. 

5 " Autosomal dominant " means that a trait is encoded on one of the non- 

sex chromosomes (autosomes) and is dominant for the phenotype it dictates for an 
individual having a heterozygous state. 

" LOD score " is a standard measure in genetics of the likelihood of a trait 
being localized in the interval being scored. Is the logarithm of a calculated probability. 
10 " Vector " refers to an assembly which is capable of directing the 

expression of an AD4 gene, as well as any additional sequence(s) or gene(s) of interest. 
The vector must include transcriptional promoter elements which are operably linked to 
the genes of interest. The vector may be composed of either deoxyribonucleic acids 
("DNA"), ribonucleic acids ("RNA"), or a combination of the two (e.g., a DNA-RNA 

15 chimeric). Optionally, the vector may include a polyadenylation sequence, one or more 
restriction sites, as well as one or more selectable markers such as neomycin 
phosphotransferase or hygromycin phosphotransferase. Additionally, depending on the 
host cell chosen and the vector employed, other genetic elements such as an origin of 
replication, additional nucleic acid restriction sites, enhancers, sequences conferring 

20 inducibility of transcription, and selectable markers, may also be incorporated into the 
vectors described herein. 

Abbreviations : AD, Alzheimer's disease; APP, amyloid precursor 
protein gene; APLPI and APLP2, amyloid precursor like proteins: CNS, central 
nervous system; DS, Down syndrome; FAD, familial AD; HCHWA-D, hereditary 

25 cerebral hemorrhage viath amyloidosis - Dutch type; (Zjnax)^ maximum LOD score; 
NFPs, neurofibrillary tangles; STRP, short tandem repeal polymorphism; 0, 
recombination fraction; VG, Volga German; AD3, the designation given to the 
chromosome 14 early-onset FAD gene; YAC, yeast artificial chromosome; EST, 
espressed sequence tag; PGR, polymerase chain reaction; RT-PCR, PGR process in 

30 which RNA is first transcribed into DNA at the first step using reverse transcriptase 
(RT); cDNA, any DNA made by copying an RNA sequence into DNA form. 

As noted above, the present invention provides methods and 
compositions for the detection and treatment of Alzheimer's Disease. These methods 
35 and compositions are based upon the discovery of a gene which is located on 
chromosome 1, which when mutated increases the probability of Alzheimer's Disease. 
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This gene, designated AD4, was found to be located on chromosome I at locus lq3J- 
42- 

Allhough one embodiment of an AD4 gene is disclosed in Figure 1, it 
should be understood that the present invention is not so limited. In particular, within 
5 the context of the present invention reference to the AD4 gene should be understood to 
include derivatives, analogs, or allelic variants of the gene disclosed in Figure I that are 
substantially similar. As used herein, a nucleic acid molecule is deemed to be 
"substantially similar" if: (a) the nucleotide sequence is derived from the coding region 
of the described gene and includes portions of the sequence or allelic variations of the 
sequences discussed above; (b) the nucleotide sequence is capable of hybridization to 
nucleotide sequences of the present invention under high or very high stringency (see 
Sambrook et al.. Molecular Cloning: A Laboratory Manual 2nd ed.. Cold Spring 
Harbor Laboratory Press, NY, 1989): or (c) the DNA sequences are degenerate as a 
result of the genetic code to the DNA sequences defined in (a) or (b). In addition, the 
15 genomic sequence may contain only polymorphisms, either in the coding or non- 
translated region, or insertions, deletions and the like. I-'urther, the AD4 gene includes 
both complementary and non-complementary sequences, provided the sequences 
otherwise meet the criteria set forth herein. Within the context of the present invention, 
high stringency means standard hybridization conditions {eg,, 5x SSPE, 0.5% SDS at 
20 65^*0, or the equivalent), such that an appropriate nucleotide sequence is able to 
selectively hybridize to nucleotide sequences from the AD-related gene, and to mutant 
nucleotide sequences. Very high stringency means the nucleotide sequence is able to 
selectively hybridize to a single allele of the AD-relaled gene. 

The AD4 gene may be isolated from genomic DNA or cDNA. Genomic 
25 DNA libraries constructed in vectors, such as YACs (yeast artificial chromosomes), 
bacteriophage vectors, such as XEMBL3, >-gtl0, cosmids or plasmids, are suitable for 
use. cDNA libraries constructed in bacteriophage vectors, plasmids, or others, are 
suitable for screening. Such libraries may be constructed using methods and techniques 
known in the art {see Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold 
30 Spring Harbor Press, 1989) or purchased from commercial sources {e.g.. Clontech). 
Within one embodiment, the AD4 gene is isolated by PCR performed on genomic 
DNA, cDNA or libraries, or by probe hybridization of genomic DNA or cDNA 
libraries, f rimers for PCR and probes for hybridization screening may be designed 
based on the DNA sequence of AD4 presented herein. The DNA sequence of an AD4 
35 gene is presented in Figure 1 A and B; and the predicted amino acid sequence is 
presented in Figure 2. Primers for PCR should be derived from sequences in the 5' and 
3* untranslated region in order to isolate a full-length cDNA. The primers should not 
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have self-complementary sequences nor have complementary sequences at their 3* end 
(to prevent primer-dimer formation). Preferably, the primers have a GC content of 
about 50% and contain restriction sites. The primers are annealed to cDNA and 
sufficient cycles of PCR are performed to yield a product readily visualized by gel 
5 electrophoresis and staining. The amplified fragment is purified and inserted into a 
vector, such as XgtIO or pBS(M13+), and propagated. An oligonucleotide 
hybridization probe suitable for screening genomic or cDNA libraries may be designed 
based on the sequence provided herein. Preferably, the oligonucleotide is 20-30 bases 
long. Such an oligonucleotide may be synthesized by automated synthesis. The 

10 oligonucleotide may be conveniently labeled at the 5' end with a reporter molecule, 
such as a radionuclide, (e.^., ^^p) biolin. The library is plated as colonies or phage, 
depending upon the vector, and the recombinant DNA is transferred to nylon or 
nitrocellulose membranes. Following denaturation, neutralization, and fixation of the 
DNA to the membrane, the membranes are hybridized with the labeled probe. The 

1 5 membranes are washed and the reporter molecule detected. The hybridizing colonies or 
phage are isolated and propagated. Candidate clones or PCR amplified fragments may 
be verified as containing AD4 DNA by any of various means. For example, the 
candidate clones may be hybridized with a second, nonoverlapping probe or subjected 
to DNA sequence analysis. In these ways, clones containing AD4 gene, which arc 

20 suitable for use in the present invention are isolated. 

The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot function of, for example, P/C Gene or Inlelligenetics Suite 
(Inielligenetics, Mountain View, Calif.), or according to the methods described by Kyte 

25 and Doolilllc {J. MoL Biol. 757:105-132, 1982). One such structure is depicted in 
Figure 4, which is a hydropathy plot of an AD4 gene product. As is evident from this 
figure, the plot depicts a protein with an extracellular domain, three extracellular loop 
domains, three intracellular loop domains, seven transmembrane regions, and an 
intracellular domain. 

3D AD4 proteins of the present invention may be in the form of acidic or 

basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 
which is to retain or further enhance or decrease the biological activity of the mutant or 

35 wild-type protein. Moreover, due to degeneracy in the genetic code, there may be 
considerable variation in nucleotide sequences encoding the same amino acid sequence. 
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Other derivatives of the AIM proteins disclosed herein include 
conjugates of the proteins along with other proteins or polypeptides. This may be 
accomplished, for example, by the synthesis of N-terminal or C-terminal fusion 
proteins \^ich may be added to facilitate purification or identification of AD4 proteins 
5 (see U.S. Patent No. 4,851,341, see also, Hopp et al., Bio/Technoiogy <5:1204, 1988.) 
Alternatively, fusion proteins such as AD4-Pgalactosidase or AD4-luciferase may be 
constructed in order to assist in the identification, expression, and analysis of AD4 
proteins. 

AD4 proteins of the present invention may be constructed using a wide 
10 variety of techniques including for example, that described in Example 5. Further, 
mutations may be introduced at particular loci by synthesizing oligonucleotides 
containing a mutant sequence, flanked by restriction sites enabling ligation to fragments 
of the native sequence. Following ligation, the resuIUng reconstructed sequence 
encodes a derivative having the desired amino acid insertion, substitution, or deletion. 
* ^ Alternatively, oligonucleotide-directed site-specific (or segment 

specific) mutagenesis procedures may be employed to provide an altered gene having 
particular codons altered according to the substitution, deletion, or insertion required. 
Exemplary methods of making the alterations set forth above are disclosed by Walder 
etal. (Gene ^2:133, 1986); Bauer etal. (Gene 37:73, 1985); Craik (BioTechniques. 

20 January 1985, 12-19); Smith etal. (Genetic Engmeermg: Principles and Methods. 
Plenum Press, 1981); and Sambrook et al. (supra). Deletion or truncation derivatives 
of AD4 proteins (e,g. a soluble extracellular portion) may also be constructed by 
utilizing convenient restriction endonuclease sites adjacent to the desired deletion. 
Subsequent to restriction, overhangs may be filled in, and the DNA relegated. 

25 Exemplary methods of making the alterations set forth above are disclosed by 
Sambrook et al. (Molecular Cloning: A Laboratory Manual^ 2d Ed., Cold Spring 
Harbor Laboratory Press, 1989). 

Mutations, which are made in the nucleic acid molecules of the present 
invention preferably preserve the reading frame of the coding sequences. Furthermore, 

30 the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA, Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For example, in 
order to select for optimum characteristics of mutants at a given site, random 

35 mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
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by restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

AD4 proteins may also be constructed utilizing techniques of PCR 
5 mutagenesis, chemical mutagenesis (Drinkwater and Klinedinst, PNAS ^J:3402-3406, 
1986), by forced nucleotide misincorporation (e.g., Liao and Wise Gene S8:\07-\ IK 
1990), or by use of randomly mutagenized oligonucleotides (Horwitz et al.. Genome 
i:l 12-117, 1989). Particularly preferred methods for constructing Alzheimer Disease 
Proteins are set forth in more detail below in the Examples. 

10 Proteins can be isolated by, among other methods, culturing suitable 

host and vector systems to produce the recombinant translation products of the present 
invention. Supematants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated by a variety of 
purification procedures in order to isolate the desired proteins. For example, the 

15 supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 

20 protein. As a further aUemative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated" within the context of the present 
25 invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
can be isolated such that no other (undesired) protein is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

The present invention also provides for the manipulation and expression 
30 of the above described genes by culturing host cells containing a vector capable of 
, expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding AD4 proteins, which are 
operably linked to suitable transcriptional or translational regulatory elements. Suitable 
regulatory elements may be derived from a variety of sources, including bacterial. 
35 fungal, viral, manunalian, insect, or plant genes. Selection of appropriate regulatory 
elements is dependent on the host cell chosen, and may be readily accomplished by one 
of ordinary skill in the art. Examples of regulatory elements include: a transcriptional 
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promoter and enhancer or RNA polymerase binding sequence, a transcriptional 
terminator, and a ribosomal binding sequence, including a translation initiation signal. 

Nucleic acid molecules that encode any of the AD4 proteins described 
above may be readily expressed by a wide variety of prokaryoiic and cukaryoiic host 
5 cells, including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 
known in the art (see, e.g.. llakura et aL, U.S. Patent No. 4,704,362; Hinnen el al., Froc. 
Natl Acad ScL USA 75:1929-1933, 1978; Murray etaL, U.S. Patent No. 4,801.542; 
Upshall ct al., U.S. Patent No. 4,935,349; Ilagcn ct al., U.S. Patent No. 4,784,950; Axel 
10 etal., U.S. Patent No, 4,399,216; Goeddel etal., U.S. Patent No. 4,766,075; and 
Sambrook et al. Molecular Cloning: A Laboratory ManuaL 2nd edition. Cold Spring 
Harbor Laboratory Press, 1989; for plant cells see Czako and Marlon, Plant Physiol. 
/r>-/: 1067- 1071, 1994; and Paszkowski et aL. Biotech 2^:387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention 
15 include E. colt, B. subtilis. Salmonella typhimuhum, and various species within the 
genus* Pseudomonas. Streptomyces, and Staphylococcus, as well as many other 
bacterial species well known to one of ordinary skill in the art. Representative 
examples of bacterial host cells include DH5a (Stratagene, LaJolIa, California). 

Bacterial expression vectors preferably comprise a promoter which 
20 functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include die P-lactamase (penicillinase) 
and lactose promoter system {see Chang etal. Nature 275:615, 1978), the T7 RNA 
polymerase promoter (Siudier el al., Meth. EnzymoL /6fi:60-89, 1990), the lambda 
promoter (Elvin et al.. Gene 57:123-126, 1990), the trp promoter (Nichols and 
25 Yanofsky, Metk in Enzymology 10 1983) and the tac promoter (Russell etal.. 
Gene 20: 231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 {see Bolivar eta!.. Gene 2:95, 1977), the pUC plasmids pUClS, 
30 pUC19, pUC118, pUCn9 {see Messing, Meth. in Enzymology 707:20-77, 1983 and 
Vicira and Messing, Gene / 9:259-268, 1982), and pNH8A, pNH16a, pNHlSa, and 
Bluescript Ml 3 (Stratagene, La JoUa, Calif.). 

Yeast and fungi host cells suitable for carrying out the present invention 
include, among others, Saccharomyces pombe, Saccharomyces cerevisiae^ the genera 
35 Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et al., 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 
among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
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(Tumbull, Bio/Technology 7:169, 1989), YRp7 (Struhl etal., Proc. Natl, Acad ScL 
USA 76:1035-1039, 1978), YEpl3 (Broach etal.. Gene ^:121-133, 1979), pJDB249 
and pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof- 
Preferred promoters for use in yeast include promoters from yeast 
5 glycolytic genes (Hitzeman etal., J. Biol. Chem, 255:12073-12080, 1980; Alber and 
Kawasaki, J. MoL Appl Genet 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young et aL, in Genetic Engineering of Microorganisms for Chemicals, IloUaendcr 
el al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. Enzymoi y(?/:192-20K 
1 983), Examples of useful promoters for fungi vectors include those derived from 
10 Aspergillus nidulans glycolytic genes, such as the adhS promoter (McKnight etal., 
EMBOJ, ^:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adhS terminaior (McKnight 
et aL, ?7^/£/., -1985). 

As with bacteria! vectors, the yeast vectors will generally include a 
15 selectable marker, which may be one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exists to enable trans formants to be selected. 
Preferred selectable markers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etal., ibid.), ura3 (Botstein etal.. Gene 8:\7, 1979), or his3 (Struhl etal., 
20 ibid.y Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid.), Hinnen et al. {Proc. Natl. Acad. Sci, 
USA 75:1929-1933, 1978), Yelton etal, (Proc. Natl. Acad Sci. USA 81:1140-1747, 
25 1984), and Russell {Nature it?;: 167-169, 1983). The genotype of the host cell may 
contain contain a genetic defect that is complemented by the selectable marker present 
on the expression vector. Choice of a particular host and selectable marker is well 
within the level of ordinary skill in the art. 

Protocols for the transformation of yeast are also well known to those of 
30 ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al„ PNAS USA 
75:1929, 1978) or by treatment with alkaline sails such as LiCl (see lloh etal., J. 
Bacteriology 753:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 
35 Viral vectors include those which comprise a promoter that directs the 

expression of an isolated nucleic acid molecule that encodes an Alzheimer Disease 
Protein as described above. A wide variety of promoters may be utilized within the 
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context of the present invention, including for example, promoters such as MoMLV 
LTR, RSV LTR, Friend MuLV LTR, Adenoviral promoter (Ohno et al, Science 265: 
781-784, 1994), neomycin phosphotransferase promoter/enhancer, late parvovirus 
promoter (Koering etal., Hum. Gene Therap, 5:457-463, 1994), Herpes TK promoter, 
5 SV40 promoter, metailothionein Ila gene enhancer/promoter, cytomegalovirus 
immediate early promoter, and the cytomegalovirus immediate late promoter. Within 
particularly preferred embodiments of the invention, the promoter is a tissue-specific 
promoter (see e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative 
examples of suitable tissue specific promoters include neural specific enolase promoter, 

10 platelet derived growth factor beta promoter, bone morpho-gcneuc protein promoter, 
human alpha 1 -chimaerin, promoter, synapsin I promoter and synapsin 11 promoter. In 
addition to the above-noted promoters, other viral-specific promoters (e.g., retroviral 
promoters (including those noted above, as well as others such as HIV promoters), 
hepatitis, herpes (e.g., EBV), and bacterial, fungal or parasitic (e.g., malarial) -specific 

15 promoters may be utilized in order to target a specific cell or tissue which is infected 
with a virus, bacteria, fungus or parasite. 

Thus, AD4 proteins of the present invention may be expressed from a 
variety of viral vectors, including for example, herpes viral vectors (e.g., U.S. Patent 
No. 5,288,641), adenoviral vectors {e.g, WO 94/26914, WO 93/9191; Kolls et al., 

20 PNAS 9Hl):2l5-2\% 1994; Kass-Eisler ei al., PNAS 90{24):] 1498-502, 1993; Guzman 
et al.. Circulation ^<?(6):2838-48, 1993; Guzman et al.. Cir. Res. 7J(6): 1202- 1207, 
1993; Zabner et aL, Cell 75(2):207-216, 1993; Li et a!.. Hum Gene Ther. ^(4):403-409, 
1993; Caillaud et al., Eur J. Neurosci. 5(10:1287-1291, 1993; Vincent et al., Nat. 
Genet i(2):130-134, 1993: Jaffe etal, ATa/. Genet. y(5):372-378, 1992; and Levrero et 

25 al. Gene / 07(2): 195-202, 1991), adeno-associated viral vectors (WO 95/13365; Flotte 
et al., FNAS P(?(22):10613-10617, 1993), baculovirus vectors, parvovirus vectors 
(Koering et al.. Hum. Gene Therap. 5:457-463, 1994), pox virus vectors (Panicali and 
Paoletti, PNAS 7^:4927-4931, 1982; and Ozaki et al.. Biochenu Biophys, Res. Comm. 
ypJ(2):653-660, 1993), and retroviruses (e.g.. EP 0,415,731; WO 90/07936; WO 

30 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patent No. 5,2 1 9,740; WO 
93/1 1230; WO 93/10218. Viral vectors may likewdse be constructed which contain a 
mixture of different elements (e.g., promoters, envelope sequences and the like) firom 
different viruses, or non-viral sources. Within various embodiments, either the viral 
vector itself, or a viral particle which contains the viral vector may be utilized in the 

35 methods and compositions described below. 

Mammalian cells suitable for carrying out the present invention include, 
among others:. PC 12 (ATCC No. CRL1721), N IE- 11 5 neuroblastoma. SK-N-BE(2)C 
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neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NG108-15 murine 
cholinergic eel! lines, or rat F2 dorsal root ganglion line, COS (e.^r., ATCC No. CRL 
1650 or 1651), BHK {e.g., ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314), CHO 
5 (ATCC No. CCL 61), HeLa (e.^., ATCC No. CCL 2), 293 (ATCC No. 1573; Graham 
et a\..J. Gen Virol. i6:59-72, 1977) and NS-1 cells. Other mammalian cell lines may 
be used within the present invention, including Rat Hep 1 (ATCC No. CRL 1600), Ral 
Hep II (ATCC No. CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC 
No. CCL 75.1), Human hepatoma (ATCC No. lITn.52), Hep G2 (ATCC No. HB 

10 8065), Mouse liver (ATCC No. CCL 29.1), NCTC 1469 (ATCC No. CCL 9, 1 ), SP2/0- 
Agl4 (ATCC No. 1581). HIT-T15 (ATCC No. CRL 1777), and RINm 5AHT2B 
(Orskov and Nielson, FEBS 229{\): 1 75-1 78, 1 988). 

Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 

1 5 or cDN A. Preferred promoters include viral promoters and cellular promoters. Viral 
promoters include the cytomegalovirus immediate early promoter (Bosharl et al.. Cell 
^7:521-530, 1985), cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani etaL, Mol. Cell Bioi 7:854-864. 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 

20 metallothionein-1 promoter (Palmiter elal., U.S. Patent No. 4,579,821), a mouse V,^ 
promoter (Bergman et al., Proc. Noll Acad. ScL USA (?/:704 1-7045, 1983; Grant et al., 
NucL Acids Res. 75:5496, 1987) and a mouse Vh promoter (Loh et al.. Cell Ji:85-93, 
1983). The choice of promoter will depend, at least in part, upon the level of 
expression desired or the recipient cell line to be trans fee ted. 

25 Such expression vectors may also contain a set of RNA splice sites 

located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 

30 polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid.), the polyadenylation signal from the Adenovirus 5 ElB 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acid^ Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 

35 the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer and the mouse Ig heavy chain enhancer (Gillies, Cell ii:71 7-728. 
1983). Expression vectors may also include sequences encoding the adenovirus VA 
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RN As. Suitable expression vectors can be obtained from commercial sources (e.g., 
Stratagcne, La Jolla. Calif.). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by, for example, calcium phosphate-mediated 
5 transfection (Wigler et al.. Cell 14:125, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973). elecux)poration 
(Neumann ctal., EMBO 1 /:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds.), Current Protocols in Molecular Biology^ John Wiley and Sons, 
Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA. a 

10 selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest. Preferred selectable markers for use in cultured mammalian cells include 
genes that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. 
The selectable marker may be an ampiifiable selectable marker. Preferred amplifiable 
selectable markers are the DHFR gene and the neomycin resistance gene. Selectable 

1 5 markers are reviewed by Thilly {Mammalian Cell Technology. Butterworth Publishers, 
Stoneham, MA, which is incorporated herein by reference). 

Mammalian cells containing a suitable vector are allowed lo grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
Drug selection is then applied to select for growth of cells that are expressing the 

20 selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired fomi or at the desired level. Cells 

25 that satisfy these criteria can then be cloned and scaled up for production. 

Protocols for the transfection of mammalian cells are well known to 
those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, electroporalion, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection {see Sambrook et al., supra). 

30 Numerous insect host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. {Pestic. ScL 2^f:2 15-224, 1990). 

Numerous plant host cells known in the art can also be useful within the 

35 present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al., (J BioscL (Bangalore) / 7:47-58, 1987). 



BNSCXXIOr <WO 97031 92A3 1A> 




wo 97/03192 PCT/US96/11386 

19 

AD4 proteins may be prepared by growing (typically by culluring) the 
host/vector systems described above, in order to express the recombinant Alzheimer 
Disease Proteins. Recombinantly produced AEM proteins may be further purified as 
described in more detail below. 
5 Within related aspects of the present invention, AD4 proteins may be 

expressed in a transgenic animal whose germ cells and somatic cells contain an AD4 
gene which is operably linked to a promoter effective for the expression of the gene 
may also be expressed in non-human transgenic animals such as mice, rats, rabbits, 
sheep, dogs and pigs (sec Hammer etal. Nature j;i:680-683, 1985, Palmiter etaL 

10 Science 222:809-814, 1983, Brinster etal. Proc\ Natl. Acad Sci. USA ^2:4438-4442. 
1985, Palmiter and Brinster Cell ^7:343-345, 1985 and U.S. Patent Nos. 5,175,383, 
5,087,571, 4,736,866, 5,387,742, 5.347,075, 5,221,778, and 5,175,384). Briefly, an 
expression vector, including a nucleic acid molecule to be expressed together with 
appropriately positioned expression control sequences, is introduced into pronuclei of 

15 fertilized eggs, for example, hy microinjection. Integration of the injected DNA is 
detected by blot analysis of DNA from tissue samples. It is preferred that the 
introduced DNA be incorporated into the germ line of the animal so that it is passed on 
to the animal's progeny. Tissue-specific expression may be achieved through the use of 
a tissue-specific promoter, or through the use of an inducible promoter, such as the 

20 melallothionein gene promoter (Palmiter el al., 1983, ihid). which allows regulated 
expression of the iransgene. 

Vectors of the present invention may contain or express a wide variety 
of additional nucleic acid molecules in place of or in addition to an AD4 protein as 
described above, either from one or several separate promoters. For example, the viral 

25 vector may express a lymphokine or lymphokine receptor, antisense or ribozyme 
sequence or toxins. Representative examples of lymphokines include IL-I, lL-2, IL-3. 
IL-4, fL-5, IL-6, IL-7, lL-8, IL-9, IL-10, IL-1 1, IL-12, IL-13, IL-14, lL-15, GM-CSF, 
G-CSF, M-CSF, alpha-inierferon, bela-interferon, gamma interferon, and tumor 
necrosis factors, as well as their respective receptors. Representative examples of 

30 antisense sequences include antisense sequences which block the expression of AD4 
protein mutants. Representative examples of toxins include: ricin, abrin, diphtheria 
toxin, cholera toxin, saporin, gelonin, pokeweed antiviral protein, Iritin, Shigella toxin, 
and Pseudomonas exotoxin A. 

Within other aspects of the invention, antisense oligonucleotide 

35 molecules are provided which specifically inhibit expression of mutant AD4 nucleic 
acid sequences {see generally, Hirashima et al. in Molecular Biology of RNA: New 
Perspectives (M. Inouye and B. S. Dudock, eds., 1987 Academic Press, San Diego, p. 
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401); Oligonucleotides: Aniisense Inhibitors of Gene Expression (J.S. Cohen, ed., 1989 
MacMillan Press, London); Stein and Cheng, Science 267:1004-1012 (1993); WO 
95/10607; U.S. 5,359,051; WO 92/06693; and EP-A2-6 12844). Briefly, such 
molecules are constructed such that they are complementary to, and able to form 
5 Watson-Crick base pairs with, a region of transcribed AD4 mutant mllNA sequence 
containing an AD4 mutation. The resultant double-stranded nucleic acid interferes with 
subsequent processing of the mRNA, thereby preventing protein synthesis. 

Within other related aspects of the invention, ribozyme molecules arc 
provided wherein an antisense oligonucleotide sequence is incorporated into a ribozyme 

10 which can specifically cleave mRNA molecules transcribed from a mutant AD4 gene 
{see generally, Kim et al. Proc. Nat. Acad. Sci. USA 84:^7S2 (1987); Haseloff/et al. 
Nature 234:5S5 (1988), Cech, JAMA 260:3030 (1988); Jeffries, el al. Nucleic Acids 
Res. 77:1371 (1989); U.S. 5,093,246; U.S. 5,354,855; U.S. 5,144,019; U.S. 5,272,262; 
U.S. 5,254,678; and U.S. 4,987,071). According to this aspect of the invention, the 

15 antisense sequence which is incorporated into a ribozyme includes a sequence 
• complementary to, and able to form Watson-Crick base pairs with, a region of the 
transcribed mutant AD4 mRNA containing an AD4 mutation. The antisense sequence 
thus becomes a targeting agent for delivery of catalytic ribozyme activity specifically to 
mutant AD4 mRNA, where such catalytic activity cleaves the mRNA to render it 

20 incapable of being subsequently processed for AD4 protein translation. 



Host Cells 

25 As discussed above, nucleic acid molecules which encode the Ab4 

proteins of the present invention (or the vectors which contain and/br express related 
mutants) may readily be introduced into a wide variety of host cells. Representative 
examples of such host cells include plant cells, eukaryotic cells, and prokaryotic cells. 
Within preferred embodiments, the nucleic acid molecules are introduced into cells 

30 from a vertebrate or warm-blooded animal, such as a human, macaque, dog, cow. horse, 
pig, sheep, rat, hamster, mouse or fish cell, or any hybrid thereof. 

Preferred prokaryotic host cells for use within the present invention 
include E. coil. Salmonella, Bacillus, Shigella. Pseudomonas, Streptomyces and other 
genera. Techniques for transforming these hosts and expressing foreign DNA 

35 sequences cloned therein are well knov^m in the art {see, e.g.. Maniatis et al.. Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 1982, which is 
incorporated herein by reference; or Sambrook etal., supra). Vectors used for 
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expressing cloned DNA sequences in bacterial hosts will generally contain a selectable 
marker, such as a gene for antibiotic resistance, and a promoter that functions in the 
host cell. Appropriate promoters include the irp (Nichols and Yanofsky, Metk 
Enzymol 7(?/: 155- 1 64, 1983), /^c (Casadaban et a!., J. Baaeriol 1 43:91] -9^0. 1980), 
5 and phage k (Queen, ./ Mol. AppL Genet. 2:1-10, 1983) promoter systems. Plasmids 
useful for transforming bacteria include the pUC plasmids (Messing, Metk Enzymol 
yO/:20-78, 1983; Vieira and Messing, Ge/ic /P:259-268, l982),pBR322 (Bolivar et al., 
Gene 2:95-1 13, 1977), pCQV2 (Queen, ibidX and derivatives thereof. Plasmids may 
contain both viral and bacterial elements. 

10 Preferred eukaryotic cells include cultured manmialian cell lines {e.g.. 

rodent or human cell lines) and fungal cells, including species of yeast {e.g., 
Saccharomyces spp„ particularly S. cerevisiae^ Schizosaccharomyces spp., or 
Kluyveromyces spp.) or filamentous fungi (e.g., Aspergillus spp., Neurospora spp.). 
Strains of the yeast Saccharomyces cerevisiae are particularly preferred. Methods for 

1 5 producing recombinant proteins in a variety of prokaryotic and eukaryotic host cells are 
generally known in the art {see, "Gene Expression Technology/' Methods in 
EnzymoIog}\ Vol. 185, Goeddel (ed.). Academic Press, San Diego, Calif., 1990; see 
also, "Guide to Yeast Genetics and Molecular Biology," Methods in Enzymolog}\ 
Guthrie and Fink (cds.). Academic Press, San Diego, Calif., 1991). In general, a host 

20 cell will be selected on the basis of its ability to produce the protein of interest at a high 
level or its ability to carry out at least some of the processing steps necessary for the 
biological activity of the protein. In this way, the number of cloned DNA sequences 
that must be introduced into the host cell can be minimized and overall yield of 
biologically active protein can be maximized . 

25 The nucleic acid molecules (or vectors) may be introduced into host 

cells by a wide variety of mechanisms, including for example calcium phosphate- 
mediated transfection (Wigler et al.. Cell 14:125, 1978), lipofection; gene gun (Corsaro 
and Pearson, Somatic Cell Gen. 7:603, 1981; Graham and Van der Eb, Plrology 52:456, 
1973), electroporation (Neumann et al., EMBO J. 7:841-845, 1982), retroviral, 

30 adenoviral, protoplast fusion-mediated transfection or DEAE-dextran mediated 
transfection (Ausubel et al„ (eds.). Current Protocols in Molecular Biology-. John 
Wiley and Sons, Inc., NY, NY, 1987). 

Host cells containing vector constructs of the present invention are then 
cultured to express a DNA molecule as described above. The cells are cultured 

35 according to standard methods in a culture medium containing nutrients required for 
growth of the chosen host cells. A variety of suitable media are known in the iart and 
generally include a carbon source, a nitrogen source, essential amino acids, vitamins 



BNSDOCID: <W O 9703192A3 IA> 



wo 97/03192 PCT/US96/1 1386 

22 

and minerals, as well as other components, e.g., growth factors or serum, that may be 
required by the particular host cells. The growth medium will generally select for cells 
containing the DNA construct(s) by, for example, drug selection or deficiency in an 
essential nutrient which is complemented by the selectable marker on the DNA 
5 constnicl or co-transfccied with the DNA construct. 

Suitable growth conditions for yeast cells, for example, include culturing 
in a chemically defined medium, comprising a nitrogen source, which may be a non- 
amino acid nitrogen source or a yeast extract, inorganic salts, vitamins and essential 
amino acid supplements at a temperature between 4°C and BT^'C, with 30°C being 

10 particularly preferred. The pH of the medium is preferably maintained at a pH greater 
than 2 and less than 8, more preferably pH 5-6. Methods for maintaining a stable pH 
include bufTering and constant pH control. Preferred agents for pH control include 
sodium hydroxide. Preferred buffering agents include succinic acid and Bis-Tris 
{Sigma Chemical Co., Si. Louis, Mo.). Due to the tendency of yeasl host cells to 

1 5 hj^erglycosylate heterologous proteins, it may be preferable to express the nucleic acid 
molecules of the present invention in yeast cells having a defect in a gene required for 
asparagine-l inked glycosylation. Such cells are preferably grown in a medium 
containing an osmotic stabilizer. A preferred osmotic stabilizer is sorbitol 
supplemented into the medium at a concentration between 0.1 M and 1.5 M, preferably 

20 at0.5Mor l.OM. 

Cultured mammalian cells are generally cultured in commercially 
available serum-containing or serum-free media. Selection of a medium and growth 
conditions appropriate for the particular cell line used is well within the level of 
ordinary skill in the art. 

25 

ANTIBODIES 

Antibodies to the AD4 proteins discussed above may readily be prepared 
given the disclosure provided herein. Such antibodies may, within certain 
embodiments, specifically recognize wild-type AD4 protein rather than mutant AD4 

30 protein, mutant AD4 protein rather than wild-type AD4 protein, or equally recognize 
both wild-type and mutant AD4 protein. Within the context of the present invention, 
antibodies are understood to include monoclonal antibodies, polyclonal antibodies, anti- 
idiotypic antibodies^ antibody firagraenls {e.g.. Fab, and F{ab*)2, Fy variable regions, or 
complementarity determining regions). As discussed above, antibodies are understood 

35 to be specific against an AD4 protein if it binds with a Ka of greater than or equal to 
I0"^M, preferably greater than of equal to 10*^M. The affinity of a monoclonal 
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antibody or binding partner can be readily determined by one of ordinary skill in the art 
(see Scaichaid, Ann, N.Y. Acad ScL J/:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 
ordinary skill in the art from a variety of warm-blooded animals such as horses, cows. 
5 various fowl, rabbits, mice, or rats. Typically, an AD4 protein or unique ADA peptide 
of 13-20 amino acids (preferably conjugated to keyhole limpet hemocyanin by cross- 
linking with glutaraldehyde) is utilized to immunize the animal through intraperitoneal, 
intramuscular, intraocular, or subcutaneous injections, an adjuvant such as Freund's 
complete or incomplete adjuvant. Following several booster immunizations, samples 

10 of serum are collected and tested for reactivity to the AD4 protein. Particularly 
preferred polyclonal antisera will give a signal on one of these assays that is ai least 
three times greater than background. Once the titer of the animal has reached a plateau 
in terms of its reactivity to the protein, larger quantities of antisera may be readily 
obtained either by weekly bleedings, or by exsanguinating the animal. 

15 Monoclonal antibodies may also be readily generated using conventional 

techniques {see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 
McKeam, and Bechtol (eds.), 1980, and Antibodies: A Laboratory Manual. Harlow and 

20 Lane (eds.). Cold Spring Harbor Laboratory Press, 1988. which are also incorporated 
herein by reference). 

Briefly, within one embodiment a subject animal such as a rat or mouse 
is injected with an AD4 protein or portion thereof as described above. The protein may 
be admixed with an adjuvant such as Freund*s complete or incomplete adjuvant in order 

25 to increase the resultant immune response. Between one and three weeks after the 
initial inununization the animal may be reimmunized with another booster 
immunization, and tested for reactivity to the protein utilizing assays described above. 
Once the animal has reached a plateau in its reactivity to the injected protein, it is 
sacrificed, and organs which contain large numbers of B cells such as the spleen and 

30 lymph nodes are harvested. 

Cells which are obtained from the inmiunized animal may be 
immortalized by transfection with a virus such as the Epstein-Barr virus (EBV) {see 
Glasky and Reading, Hybridoma 8(4)'Ml'l%9, 1989). Alternatively, within a preferred 
embodiment, the harvested spleen and/or lymph node cell suspensions are fused with a 

35 suitable myeloma cell in order to create a "hybridoma" which secretes monoclonal 
antibody. Suitable myeloma lines include, for example, NS-l (ATCC No. TIB 18), and 
P3X63 - Ag 8.653 (ATCC No. CRL 1580). 
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Following the fusion, the cells may be placed into culture plates 
containing a suitable medium, such as RPMI 1640, or DMEM (Dulbecco's Modified 
Eagles Medium) (JRH Biosciences, Lenexa, Kansas), as well as additional ingredients, 
such as Fetal Bovine Serum (FBS, /.e., from Hyclone, Logan, Utah, or JRH 
5 Biosciences). Additionally, the medium should contain a reagent which selectively 
allows for the growth of fused spleen and myeloma cells such as HAT (hypoxanihine, 
aminopterin, and thymidine) (Sigma Chemical Co., St. Louis, Missouri). After about 
seven days, the resulting fused cells or hybridomas may be screened in order to 
delcnninc the presence of antibodies which arc reactive against an AD4 protein. A 

10 wide variety of assays may be utilized to determine the presence of antibodies which 
arc reactive against the proteins of the present invention, including for example 
countercurrent immuno-electrophoresis. radioimmunoassays, radioimmunoprecipita- 
tions, enzyme-linked immuno-sorbent assays (ELISA), dot blot assays. Western blots, 
immunoprecipitation, inhibition or competition assays, and sandwich assays {see U.S. 

15 Patent Nos. 4,376,110 and 4,486,530; see also Aniibodies: A Laboratory ManuaL 
Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 1988). Following 
several clonal dilutions and reassays, a hybridoma producing antibodies reactive against 
the AD4 protein may be isolated. 

Other techniques may also be utilized to construct monoclonal 

20 antibodies {see William D. Huse etal., "Generation of a Large Combinational Library 
of the Immunoglobulin Repertoire in Phage Lambda," Science 2-^6:1275-1281, 
December 1989; see also L. Sastry et al., "Cloning of the Immunological Repertoire in 
Escherichia coll for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc Natl. Acad Sci. USA 

25 ^?6. 5728-5732, August 1989; see a/50 Michelle Alting-Mees etal., "Monoclonal 
Antibody Expression Libraries: A Rapid Alternative to Hybridomas,** Strategies in 
Molecular Biology i:I-9, January 1990; these references describe a commercial system 
available from Stratacyte, La Jolla, California, which enables the production of 
antibodies through recombinant techniques). Briefly, mRNA is isolated from a B cell 

30 population, and utilized to create heavy and light chain immunoglobulin cDNA 
expression libraries in the XlmmunoZapCH) and A.lmmunoZap(L) vectors. These 
vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies {see Huse et al., supra; sec also Sastry et al., supra). Positive plaques may 
subsequently be converted to a non-lytic plasmid which allows high level expression of 

35 monoclonal antibody fragments from E. colt. 

Similarly, portions or fragments, such as Fab or Fv fragments, of 
antibodies may also be constructed utilizing conventional enzj'matic digestion or 
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recombinant DNA techniques to incorporate the variable regions of a gene which 
encodes a specifically binding antibody. Within one embodiment, the genes which 
encode the variable region from a hybridoma producing a monoclonal antibody of 
interest are amplified using nucleotide primers for the variable region. These primers 
5 may be synthesized by one of ordinary skill in the art, or may be purchased from 
commercially available sources. Stratacyie (La Jolia, Calif) sells primers for mouse 
and human variable regions including, among others, primers for V^a, V^b^ ^hc- ^Hd* 
ChI. Vl and Cl regions. These primers may be utilized to amplify heav>' or light chain 
variable regions, which may then be inserted into vectors such as InununoZAP"*^^^ H or 

10 InmiunoZAP™ L (Stratacyte), respectively. These vectors may then be introduced into 
E. coll, yeast, or mammalian-based systems for expression. Utilizing these techniques, 
large amounts of a single-chain protein containing a fusion of the and domains 
may be produced (sec Bird eta!.. Science 242:423-426, 1988). In addition, such 
techniques may be utilized to change a "murine" antibody to a "human" antibody, 

1 5 without altering the binding specificity of the antibody. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art (see 
Antibodies: A Laboratory ManuaL Harlow and Lane (eds.). Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 

20 columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 

Antibodies of the present invention have many uses. For example, 
antibodies may be utilized in flow cytometry to sort cells bearing such an AD4 protein. 
Briefly, in order to detect the protein or peptide of interest on cells, the cells are 

25 incubated with a labeled monoclonal antibody which specifically binds to the protein of 
interest, followed by detection of the presence of bound antibody. These steps may also 
be accomplished with additional steps such as washings to remove unbound antibody. 
Labels suitable for use within the present invention are well known in the art including, 
among others, flourescein isothiocyanale (FITC), phycoerythrin (PE), horse radish 

30 peroxidase (HRF), and colloidal gold. Particularly preferred for use in flow cytometry, 
is FITC, which may be conjugated to purified antibody according to the method of 
Keltkamp in "Conjugation of Fluorescein Isothiocyanale to Antibodies. I. Experiments 
on the Conditions of Conjugation,** Immunology /^:865-873, 1970. (See also 
Keltkamp. "Conjugation of Fluorescein Isothiocyanate to Antibodies. IL A 

35 Reproducible Method," Immunology 7^:875-881, 1970; Coding, "Conjugation of 
Antibodies with Fluorochromes: Modification to the Standard Methods," 1 Immunol. 
Methods 13:2] 5-226. 1970.). 
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Assays 

Assays useful within the conlexl of the present invention include those 
assays for detecting agonists or antagonists of AD4 protein activity. Other assays are 
5 useful for the screening of peptide or organic molecule libraries. Siill other assays are 
useful for the identification and/or isolation of nucleic acid molecules and/or peptides 
within the present invention, or for diagnosis of a patient with an increased likelihood 
of contracting Alzheimer's Disease. 

10 A. Nucleic Acid Based Diagnostic Tests 

Briefly, another aspect of the present invention provides probes and 
primers for detecting the AD4 genes and/or mutants thereof. In one embodiment of this 
aspect, probes are provided that are capable of specifically hybridizing to RNA or DNA 
from the AD4 gene. For purposes of the present invention, probes are "capable of 

15 hybridizing" to AIM genes DNA or RNA if they hybridize to an AD4 gene under 
conditions of either high or moderate stringency (jec^ Sambrook et al.. supra) but not 
significantly or detectably to the AD3 gene. Preferably, the probe may be utilized to 
hybridize to suitable nucleotide sequences under high stringency conditions, such as 5x 
SSPE, Ix Denhardt's solution (Sambrook et al., ii/pra), 0.1% SDS at 65°C and at least 

20 one wash to remove excess probe in the presence of 0.2x SSC, Ix Denhardt's solution, 
0.1% SDS at dS^'C. Except as otherwise provided herein, probe sequences are designed 
to allow hybridization to AD4 genes, but not to DNA or RNA sequences from other 
genes, such as AD3. The probes are used, for example, to hybridize to nucleic acid that 
is present in a biological sample isolated from a patient. 1 he hybridized probe is then 

25 detected, thereby indicating the presence of the desired cellular nucleic acid. 
Preferably, the cellular nucleic acid is subjected to an amplification procedure, such as 
PCR. prior to hybridization. Alternatively, the AD4 gene may be amplified and the 
amplified product subjected to DNA sequencing. Mutants of AD4 may be detected by 
DNA sequence analysis or hybridization with allele-specific oligonucleotide probes 

30 under conditions and for time sufficient to allow hybridization to the specific allele. 
Typically, the hybridization buffer and wash will contain tetramethyl ammonium 
chloride or the like (see Sambrook et al., supra). 

Nucleic acid probes of the present invention may be composed of either 
deoxyribonucleic acids (DNA), ribonucleic acids (RNA), nucleic acid analogues (e.g.. 

35 peptide nucleic acids), or any combination thereof, and may be as few as about 12 
nucleotides in length, usually about 14 to 18 nucleotides in length, and possibly as large 
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as the entire sequence of an AD4 gene. Selection of probe size is somewhat dependent 
upon the use of the probe, and is within the skill of the art. 

Suitable probes can be constructed and labeled using techniques that are 
well known in the art. Shorter probes of, for example, 12 bases can be generated 

5 synthetically and labeled with 32p using T4 polynucleotide kinase. Longer probes of 
about 75 bases to less than 1.5 kb are preferably generated by, for example, PCR 
amplification in the presence of labeled precursors such as [a-^^pjdCTP, digoxigenin- 
dUTP, or biolin-dATP. Probes of more than 1.5 kb are generally most easily amplified 
by transfecling a cell with a plasmid containing the relevant probe, growing the 

10 transfected cell into large quantities, and purifying the relevant sequence from the 
transfecled cells. (See Sambrook et aL, supra,) 

Probes can be labeled by a variety of markers, including for example, 
radioactive markers, fluorescent markers, enzymatic markers, and chromogenic 
markers. The use of ^^P is particularly preferred for marking or labeling a particular 

1 5 probe. 

It is a feature of this aspect of the invention that the probes can be 
utilized to detect the presence of AD4 mRNA or DN A within a sample. However, if 
the nucleic acid is present in only a limited number, then it may be beneficial to amplifS* 
the relevant sequence so that it may be more readily delected or obtained. 

20 A variety of methods may be utilized in order to amplify a selected 

sequence, including, for example, RNA amplification {see Lizardi el al., 
Bio/Technology 6:\ 197-1202, 1988; Kramer et al.. Nature ijy:40M02, 1989; Lomcli 
etai., Clinical Chem. J3(9):1826-I83l, 1989; U.S. Patent No. 4,786,600), and DNA 
amplification utilizing LCR or Polymerase Chain Reaction TPCR") (see, U.S. Patent 

25 Nos. 4,683,195, 4,683,202, and 4,800,159) {see also U.S. Patent Nos. 4,876,187 and 
5,01 1,769, which describe an alternative detection/amplification system comprising the 
use of scissile linkages), or other nucleic acid amplification procedures that are well 
within the level of ordinary skill in the art. With respect to PCR, for example, the 
method may be modified as known in the art. Transcriptional enhancement of PCR 

30 may be accomplished by incorporation of bacteriophage T7 RNA polymerase promoter 
sequences in one of the primary oligonucleotides, and immunoenzymatic detection of 
the products from the enhanced emitter may be effected using anti-RNA:DNA 
' antibodies (Blais, Appl Environ. Microbiol. 60:348-352, 1994). PCR may also be used 
in combination with reverse dot-blot hybridization (lida et al., FEMS Microbiol Lett. 

35 1 7^:167-172, 1993). PCR products may be quantitatively analyzed by incorporation of 
dUTP (Duplaaetal., Anal Biochem, 2/2:229-236. 1993), and samples may be filter 
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sampled for PCR-gene probe detection (Bejelal., Appi Environ. Microbiol. 
57:3529-3534, 1991). 

Within a particularly preferred embodiment, PGR amplification is 
utilized to detect AD4 DNA. Briefly, as described in greater detail below, a DNA 
5 sample is denatured at 95X in order to generate single-stranded DNA. Specific 
primers are then annealed to the single-stranded DNA at 37**C to l{fC^ depending on 
the proportion of A17GC in the primers. The primers are extended at 72*C with Tag 
DNA polymerase in order to generate the opposite strand to the template. These steps 
constitute one cycle, which may be repeated in order to amplify the selected sequence. 

10 Within an alternative prefenred embodiment, LCR amplification is 

utilized for amplification. LCR primers arc synthesized such that the 5* base of the 
upstream primer is capable of hybridizing to a unique base pair in a desired gene to 
specifically detect an AD4 gene. 

Within another preferred embodiment, the probes arc used in an 

1 5 automated, non-isotopic strategy wherein target nucleic acid sequences arc amplified by 
PGR, and then desired products are determined by a colon metric oligonucleotide 
ligation assay (OLA) (Nickerson et ah, Proc. Natl. Acad Sci. USA ^7:8923-8927, 
1990). 

Primers for the amplification of a selected sequence should be selected 

20 from sequences that are highly specific to ADA (an not AD3) and form stable duplexes 
with the target sequence. The primers should also be non-complementan\ especially at 
the 3' end, should not form dimers with themselves or other primers, and should not 
form secondary structures or duplexes with other regions of DNA. In general, primers 
of about 18 to 20 nucleotides are preferred, and can be easily synthesized using 

25 techniques well known in the art. PGR products, and other nucleic acid amplification 
products, may be quantitated using techniques known in the art (Duplaa et al., Anai 
Biochem. 212:229-236, 1993; Higuchi et al., Bio/Technology 77:1026-1030). 

Within a particularly prefened embodiment, restriction enzyme analysis 
is used to detect a mutant AD4 gene. For example, within One embodiment the N 1 4 1 1 

30 mutation in AD4 creates a 5aw3A I restriction site. cDNA, genomic DNA, or amplified 
AD4 cDNA or genomic DNA is restricted with Sau3A 1. If the AD4 gene contains the 
N14il alteration, the Sau3A I fragment or amplified product containing codon 141 will 
be cleaved into two smaller fragments. The cleavage is readily detected by 
elecu-ophoresis in agarose or acrylamide gels and visualized by ethidium bromide 

35 fluorescence or probe hybridization using AD4 gene fragments or oligonucleotides. 
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B. Antibody-based Diagnostic Tests 

Still another aspect of the present invention provides antibodies, as 
discussed above, for the detection of AD4 gene products in diagnostic tests. A variety 
of assays can be utilized in order to detect antibodies that specifically bind to the 
5 desired protein or peptide. Exemplary assays are described in detail in Antibodies: A 
Laboratory Manual, Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 
1988. Representative examples of such assays include: countercurrenl immuno- 
electrophoresis (CIEP), radioinununoassays, radio immunoprccipilations, enzyme- 
linked immuno-sorbent assays (ELISA), dot blol assays, inliihition or competition 

10 assays, and sandwich assays, immunostick (dipstick) assays, simultaneous 
immunoassays, immunochromatographic assays, immunofiltration assays, latex bead 
agglutination assays, immunofluorescent assays, biosensor assays, and low-light 
detection assays {see U.S. Patent Nos. 4,376,1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual supra). 

15 A fluorescent antibody test (FA-test) uses a fluorescently-labelcd 

antibody able to bind to one of the proteins of the invention. For detection, visual 
determinations are made by a technician using fluorescence microscopy, yielding a 
qualitative result. In a preferred embodiment, this assay is used for the examination of 
tissue samples and histological sections. 

20 in latex bead agglutination assays, antibodies to one or more of the 

proteins of the present invention are conjugated to latex beads. The antibodies 
conjugated to the latex beads are then contacted with a sample under conditions 
permining antibodies to bind to desired proteins in the sample, if any. The results are 
then read visually, yielding a qualitative result. In a preferred embodiment, this format 

25 can be used in the field for on-site testing- 
Enzyme immunoassays (EIA) include a number of different assays able 
to utilize the antibodies provided by the present invention. For example, a 
heterogeneous indirect EIA uses a solid phase coupled with an antibody of the 
invention and an affinity purified, anti-IgG immunoglobulin preparation. Preferably, 

30 the solid phase is a polystyrene microliter plate. The antibodies and immunoglobulin 
preparation are then contacted with the sample under conditions permitting antibody 
binding, which conditions are well known in the art. The results of such an assay can 
be read visually, but are preferably read using a spectrophotometer, such as an ELISA 
plate reader, to yield a quantitative result. 

35 An alternative solid phase EIA format includes a plastic-coated ferrous 

metal beads able to be moved during the procedures of the assay by means of a magnet. 
Yet another alternative is a low-light detection immunoassay format. In this highly 
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sensitive formal, the light emission produced by appropriately labeled bound antibodies 
are quantitated automatically. Preferably, the reaction is performed using microtiter 
plates. 

In a capture-antibody sandwich enzyme assay, the desired protein is 
5 bound between an antibody attached to a solid phase, preferably a polystyrene 
microtiter plate, and a labeled antibody. Preferably, the results are measured using a 
spectrophotometer, such as an ELISA plate reader. 

In an alternative embodiment, a radioactive tracer is substituted for the 
enzyme mediated detection in an EI A to produce a radioimmunoassay (RIA). 

10 In a sequential assay format, reagents are allowed to incubate with the 

capture antibody in a step wise fashion. The test sample is first incubated with the 
capture antibody. Following a wash step, an incubation with the labeled antibody 
occurs, in a simultaneous assay, tiie two incubation periods described in the sequential 
assay arc combined. This eliminates one incubation period plus a wash step. 

15 A dipstick/immunostick format is essentially an immunoassay except 

that the solid phase, instead of being a polystyrene microtiter plate, is a polystyrene 
paddle or dipstick. Reagents are the same and the formal can either be simultaneous or 
sequential. 

In a chromatographic strip test format, a capture antibody and a labeled 

20 antibody are dried onto a chromatographic strip, which is typically nitrocellulose or 
nylon of high porosity bonded to cellulose acetate. The capture antibody is usually 
spray dried as a line at one end of the strip. At this end there is an absorbent material 
that is in contact with the strip. At the other end of the strip the labeled antibody is 
deposited in a manner that prevents it from being absorbed into the membrane. 

25 Usually, the label attached to the antibody is a latex bead or colloidal gold. The assay 
may be initiated by applying the sample immediately in front of the labeled antibody. 

Immunofiltration/immunoconcentration formats combine a large solid 
phase surface with directional flow of sample/reagents, which concentrates and 
accelerates the binding of antigen to antibody. In a preferred format, the test sample is 

30 preincubated with a labeled antibody then applied to a solid phase such as fiber filters 
or nitrocellulose membranes or the like. The solid phase can also be precoaled with 
latex or glass beads coated with capture antibody. Detection of analyte is the same as 
standard immunoassay. The flow of sample/reagents can be modulated by either 
vacuum or the wicking action of an underlying absorbent material. 

35 A threshold biosensor assay is a sensitive, instrumented assay amenable 

to screening large number of samples at low cost. In one embodiment, such an assay 
comprises the use of light addressable potentiometric sensors wherein the reaction 
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involves the detection of a pH change due to binding of the desired protein by capture 
antibodies, bridging antibodies and urease-conjugated antibodies. Upon binding, a pH 
change is effected that is nieasurable by translation into electrical potential (jivolts). 
The assay typically occurs in a very small reaction volume, and is very sensitive. 
5 Moreover, the reported detection limit of the assay is 1,000 molecules of urease per 
minute. 

C. Other Assays 

Transmembrane receptors arc involved in many cellular communication 

10 process and have been the targets of numerous pharmacologic screening assays for the 
identification and development of new therapeutic agents. Many of these screening 
assays look for ligand induced changes in cell lines expressing the recombinant 
receptor. In some cases second messengers are assayed directly while in others, 
receptor is transfected into a cell line carrying a reporter gene construct whose 

15 expression level can be influenced (positively or negatively) by Amctional activation of 
the receptor. One common result of the stimulation of many diflerent second 
messenger systems is transient changes in intracellular calcium homeostasis, this can 
be the result of Ca-M- release from various intracellular compartments or from the influx 
of extracellular calcium. 

20 Calcium transients offer a highly sensitive and selective method for 

characterization of AD4 gene function. Expression of recombinant AD4 in cell lines 
previously transfected with an aequorin reporter construct can be used to screen for and 
identify an AD4 ligand. Aequorin is a 21 kDa photoproiein that upon Ca^ + binding 
undergoes an irreversible reaction with the production of light in the visible range. 

25 Because the fractional rate of aequorin consumption is proportional in the physiological 
fCa++], it has been used for many years as a sensitive indicator of intracellular calcium. 
More recently, several different aequorin cDNA's have been engineered which allow 
selective targetting of aequorin expression to different intracellular compartments, 
including the cytoplasm, the nucleus and the endoplasmic reticulum. This allows for a 

30 variety of second messenger coupled pathways/compartments to be screened. 

Threefore, within one embodiment a cell line which expresses mutant 
AD4 may be utilized in order to identify compounds v^ich modify the mutant protein 
function in a way that mimics wild-type AD4 activity. 

35 Labels 

AD4 proteins, nucleic acid molecules which encodes such proteins, anti- 
AD4 protein antibodies and agonists or antagonists, as described above and below, may 
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be labeled with a variety of molecules, including for example, fluorescent molecules, 
toxins, and radionuclides. Representative examples of fluorescent molecules include 
fluorescein, /^/ryco^///. proteins, such as phycoerythrin. hodamine, Texas red and 
luciferase. Representative examples of toxins include ricin, abrin diphtheria toxin, 
5 cholera toxin, gelonin, pokeweed antiviral protein, tritin, Shigella toxin, and 
Pseudomonas exotoxin A. Representative examples of radionuclides include Cu-64, 
Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105, Pd-109, In-1 1 1, 1-123, M25. 1-J31, Re- 
186, Re-188, Au-198, Au-199, Pb-203. At-2I1, Pb.212 and Bi-212. In addition, the 
antibodies described above may also be labeled or conjugated to one partner of a ligand 
10 binding pair. Representative examples include avidin-biotin, and riboflavin-riboflavin 
binding protein. 

Methods for conjugating or labeling the AD4 proteins, nucleic acid 
molecules vtliich encodes such proteins, anti-AD4 protein antibodies and agonists or 
antagonists, as discussed above, with the representative labels set forth above may be 

15 readily accomplished by one of ordinary skill in the art (see Trichothecene Antibody 
Conjugate, U.S. Patent No. 4,744,981,; Antibody Conjugate, U.S. Patent No. 
5,106,951; Fluorogenic Materials and Labeling Techniques, U.S. Patent No. 4,018,884; 
Metal Radionuclide Labeled Proteins for Diagnosis and Therapy, U.S. Patent No. 
4,897,255; and Metal Radionuclide Chelating Compounds for Improved Chelation 

20 Kinetics, U.S. Patent No. 4.988,496; see also Inman, Methods In Enzymolofiv, Vol. 34, 
Affinity Techniques, Enzyme Purification: Part B, Jakoby and Wilchek (eds.). 
Academic Press, New York, p. 30, 1974; see also Wilchek and Bayer, "The Avidin- 
Biolin Complex in Bioanalytical Applications," Anai Biochem. / 7/: 1-32, 1988). 

25 Pharmaceutical Composttions 

As noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-described AD4 proteins, 
nucleic acid molecules, vectors, antibodies, host cells, agonists or antagonists, along 
with a pharmaceutically or physiologically acceptable carrier, excipients or diluents. 

30 Generally, such carriers should be nontoxic to recipients at the dosages and 
concentrations employed. Ordinarily, the preparation of such compositions entails 
combining the therapeutic agent with buffers, antioxidants such as ascorbic acid, low 
molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, 
carbohydrates including glucose, sucrose or dexirins, chelating agents such as EDTA, 

35 glutathione and other stabilizers and excipients. Neutral buffered saline or saline mixed 
with nonspecific serum albumin are exemplary appropriate diluents. 
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In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes, although 
intracranial routes arc typically preferred. In addition, pharmaceutical compositions of 
the present invention may be placed w\h\n containers, along with packaging material 
5 which provides instructions regarding the use of such pharmaceutical compositions. 
Generally, such instructions will include a tangible expression describing the reagent 
concentration, as well as within certain embodiments, relative amounts of excipient 
ingredients or diluents (e.g., water, saline or PBS) which may be necessary to 
rca^nstilute the pharmaceutical composition. 

10 

Methods of Treating or Preventing Alzheimrr Disease 
The present invention also provides methods for treating or preventing 
Alzheimer Disease, comprising the step of administering to a patient a vector (e^g.. 
expression vector, viral vector, or viral particle containing a vector) or nucleic acid 
15 molecules alone, as described above, thereby reducing the likelihood or delaying the 
onset of Alzlieimer's Disease. 

Similarly, therapeutic peptides, peptidomimetics, or small molecules 
may be used to delay onset of Alzheimer's disease, lessen symptoms, or halt or delay 
progression of the disease. Such therapeutics may be tested in a transgenic animal 
20 model that expresses mutant protein, wild-type and mutant protein, or in an in vitro 
assay system. 

One such in vitro assay system measures the amount of amyloid protein 
produced. Briefly, by way of illustration, a cell expressing both AD4 gene product and 
amyloid is cultured in the presence of a candidate therapeutic molecule. The AD4 

25 protein expressed by the cell may be either wild- type or mutant protein. In either case, 
the amount of amyloid protein that is produced is measured from cells incubated with 
or without (control) the candidate therapeutic. Briefly, by way of example, cells arc 
labeled in medium containing ^^S-methionine and incubated in the presence (or 
absence) of candidate therapeutic. Amyloid protein is detected in the culture 

30 supematent by immunoprecipitation and SDS-PAGE electrophoresis or by ELISA. A 
statistically significant reduction of amyloid protein compared to the control signifies a 
therapeutic suitable for use in preventing or treating Alzheimer's disease. 

' Alternatively, transgenic animals expressing Alzheimer's disease protein 
may be used to test candidate therapeutics. Amyloid protein is measured or, if the 

35 animals exhibit other disease symptoms, such as memory or learning deprivation, an 
increase in memory or learning is measured. Memory and learning are tested in rodents 
by the Morris water maze (Stewart and Morris in Behavioral Neurosciencc, R. Saghal, 
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Ed. (iRLPress, 1993, p. 107) and the Y-maze (Bnis ct al. Brain Res. Bull. 6:7], 1981). 
Therapeutics are administered to animals prior to testing. The response time in trials 
are measured and an improvement in memory and learning is demonstrated by a 
statistically significant decrease in the timed trials. 
^ As noted above, the present invention provides methods for treating or 

preventing Alzheimer's disease through the administration to a patient of a 
therapeutically effective amount of an antagonist or pharmaceutical composition as 
described herein. Such patients may be identified through clinical diagnosis based on 
symptoms of dementia or ieaniing and memory loss which are not attributable to otlier 

10 causes. In addition, patients are also identified through diagnosis of brain atrophy as 
determined by magnetic resonance imaging. 

Cognitive behavior in AD may be measured by any one of several tests 
(See Gershon et al.. Clinical Evaluation of Psychotropic Drugs: Principles and 
Guidelines. Prien and Robinson (eds.). Raven Press, Ltd.. New York, 1994, p. 467). 

15 One such test, BCRS, is designed to measure only cognitive functions: concentration, 
recent memory, past memory, orientation, functioning, and self-care. This test, as well 
as the Weschler Memory Scale and the Alzheimer's Disease-Associated Scale, may be 
used to determine improvement following therapeutic treatment. "Improvement" in 
Alzheimer's disease is present if there is a statistically significant difference in the 

20 direction of normality in the Weschler Memor>' Scale test. For example, test results of 
the performance of treated patients as are compared to members of the placebo group or 
between subsequent tests given to the same patient. Improvement within the present 
invention also encompasses a delay in the age of onset of Alzheimer's disease. 

As will be evident to one of skill in the art, the amount and frequency of 

25 administratioi; will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patienu and so forth. 
Typically, the compositions may be administered by a variety of techniques, although 
intra-cronial routes are of^en preferred. 

Within other embodiments of the invention, the. vectors which contain or 
.30 express the nucleic acid molecules which encode the AD4 proteins described above, or 
even the nucleic acid molecules themselves may be administered by a variety of 
alternative techniques, including for example administration of asiaioosomucoid 
(ASOR) conjugated with poly (L-lysine) DNA complexes (Cristano et at., PNAS 
92122-92126, 1993), DNA linked to killed adenovirus (Curiel et aL, Hum. Gene Tker 

35 5(2): 1 47- 154, 1992), cylofectin-mediated introduction (DMRIE-DOPE, Vical, Calif.), 
direct DNA injection (Acsadi el al.. Nature i52:815-818, 1991); DNA ligand (Wu et 
al., J. of Biol Chem. 25^:16985-16987, 1989); lipofeciion (Feigner et al., Proc. Natl. 



SNSOOCtO: <WO 97Qai92A3..IA> 




wo 97/03192 



PCT/US96/n3»6 



35 



Acad ScL USA *-/:74 13-74 17, 1989); liposomes (Pickering et aL, Qrc. ^9(1):13.2K 
1994; and Wang et al., PNAS <!^4:785 1-7855, 1987); microprojectile bombardment 
(Williams el al.. PNAS 882126-2120, 1991); and direct delivery of nucleic acids which 
encode the AD4 protein itself either alone (Vile and Hart, C-anccr Res. 53: 3860-3864, 
5 1 993), or utilizing PEG-nucleic acid complexes. 



The following examples are offered by way of illustration, and not by 



way of limitation. 
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EXAMPLES 
EXAMPLE 1 

Genetic Mapping of the AD locus from the Volga German Kindred 

5 

The Volga German kindreds are a group of 7 related families with 
autosomal dominant early-onset Alzheimer's disease (AD). These families come from a 
group of Germans, who, in the 1760's, emmigrated to the Volga river region of Russia. 
The VG remained culturally distinct from the surrounding Russian population and has 

10 apparently been genetically relatively isolated until this century. In 1987, Bird and 
collegues (Bird, et al., Ann. Neurol. 2i:25-31, 1988) identified 5 VG kindreds with 
early-onset AD. Subsequently 2 additional early-onset VG families were identified 
(Bird, et al., Ann. Neurol, 25:12-25, 1989). in addition, other families with early-onset 
disease have been identified in which, similarly to the VG kindred, the AFP gene and 

15 the chromosome 14 locus have been excluded as harboring the causative gene, 'fhese 
families include 4 Swedish families with a mean agc-of-onset ranging from 55 to 64 
and 2 Dutch families with mean onsets of 52 and 59 years. The 7 VG families are 
dcscendenls of inhabitants of 3 adjacent villages from the Volga region of Russia. 
Although a common ancestor could not be traced, the unique ethnic origin coupled with 

20 150 years of relative genetic isolation suggests that AD in these families is likely the 
result of a common genetic founder. Inheritance of AD in these kindreds appears to be 
autosomal dominant. Onset of AD in these families ranges from 40 to 81 years with 
family mean age-of-onsets ranging from 51 to 65 years (Bird et al., supra. Table I). 
Numerous affected subjects in these families have been extensively characterized, both 

25 clinically and neuropathologically, and at least ! affected subject from each family has 
had autopsy confirmation of the diagnosis of AD. Except for the relatively early age- 
of-onset, AD in the VG is clinically and pathologically indistinguishable from typical 
AD, Neurofibrillary tangles, Ap neuritic plaques, and other changes associated with 
AD have been observed in multiple autopsies. 
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Table 1. VG kindreds used for linkage analysis. 





Family 


AfTcctcd 


Autopstcd 


AlTectcds 


Total 


Mean age of onset 


5 




(n) 


(n) 


sampled 


subjects 


+ SO. (n). range 




H 


8 


3 


4(2) 


4 


59.5 + 3.9 (6) 56-68 




HB 


23 


5t 


6<2) 


30 


60.8 + 7.1 (22) 54-75 




HD 


19 


2 


6 


20 


59.6+ 10.3 (17) 46-82 




KS 


14 


3 


8(1) 


29 


64.8 + 5.4 (13) 55-71 


10 


R 


20 


5 


9(3) 


37 


50.2 7.3 (17) 40-67 




W 


4 


2 


2 


4 


59-2 + 10.5 (4) 48-79 




WFL 


6 


2 


2 


15 


63.8 + 7.6 (6) 55-76 




ToUls 


94 


22 


37(8) 


139 


59.1 ^ 9.1 (86) 40-82 



15 * Includes I unaffected subject autopsicd 

^ Numbers in parentheses indicates DNA samples obtained from autopsy materiaL either as paraffin 
blocks or frozen brain. 



The locus responsible for AD in the VG kindreds has not been 

20 previously mapped genetically. Linkage analysis, as well as mutational analysis, of the 
VG families with chromosome 21 and chromosome 14 markers (AD3 locus and APOE 
locus) has yielded negative results. Moreover, the locas responsible for AD in the VG 
kindreds does not cosegregate with markers at the APP locus, and no mutations have 
been detected in the APP gene. 

25 Linkage analysis was used to scan the genome for the locus causing AD 

in these VG families. DNA was prepared from lymphoblastoid cell lines from 139 
individuals including 37 affected subjects. When suggestive evidence for linkage was 
found, autopsy-derived tissue, either frozen or embedded in paraffin, was used to 
prepare DNA from 8 additional affected subjects for whom no other tissue was 

30 available. Markers on all chromosomes were genotyped and analyzed using the lod 
score method (Morton, Am, 1 Hum. Genet. 7:277, 1955; Hodge, et al„ Am. J. Htm. 
GeneL 35\\ 139, 1983). For the genome screen, evidence for linkage was evaluated 
assuming autosomal dominant inheritance with age-dependent penetrance and a 0% 
sporadic rate. Lod scores were also computed using a low (1%) penetrance model, 

35 which makes no assumption about the disease status of at-risk individuals and thus 
serves as a check that information about linkage derives primarily from the affected 
individuals (in whom the genotype at the disease locus is more accurately known 
compared to that of at-risk subjects). Published marker allele frequencies (Genome 
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Database) were used unless critical allele frequencies were significantly lower than 
those estimated in the VG. In this case, frequencies obtained from the VG pedigrees 
using all subjects (affecteds, at-risk and spouses) were used. This conservative 
approach was taken during screening since underestimation of the frequency of an 
5 allele co-segrcgating with the disease can cause false positive evidence for linkage (On, 
Am. J. Hum Genet. J/:283, 1992). However, this approach will underestimate lod 
scores if the true allele frequency is lower than assumed. In this study. 162 publically 
available markers were genotyped. Among these were 70 STRP markers with 
heterozygosity values mostly > 0.70 spaced > 20 cM apart. 

10 This work demonstrates the presence of a previously unmapped AD 

locus on chromosome lq31-42. 

Initial suggestive lod scores for chromosome 1 were obtained for 
markers D1SI03 (Z„„ = 1.79, 9 = .10) and D1S249 (Z^, = 1.76, 0 = .20) (Table 2), 
which are separated by approximately 26 cM (Fig. 1 ). Subsequently, 21 other markers 

15 in this region spanning 55 cM were analyzed (Table 2). Using the conscr\'ative 
screening analysis conditions, significant evidence for linkage (Z^„ > 3 0) was obtained 
with D1S479 {Z„„ = 4.40, G = .11). This lod score was similar to that predicted by 
earlier power analyses. Most of the evidence for linkage comes from families HB and 
Ry with positive lod scores also observed for families W. H, and HD (Table 3). 

20 Subsequently, a 112 bp allele of DIS479 co-segregated with the disease in 5 of 7 
families, which is consistent with a common genetic founder. Other markers in the 
region also gave positive though non-significant lod scores (e.g. D1S439, 2^^ = 2.82, 0 
= ,17; DIS320, - 1,87, e = .14; DIS103, Z„„ = 2.40, 8 = .008; Table 2). 

Table 2 demonstrates that FAD is linked to chromosome 1 markers. 

25 Genotypes were perfomied by conventional nnethods, such as by PGR. All genotypes 
for D1S479 were determined in duplicate. For markers D1S227, D1S320, 01S213, 
D1S439, DIS479, D1S225, and DISI03, genotypes were determined for 8 samples 
fi-om affected subjects either derived from either paraffin blocks or frozen brain 
material (not all samples amplified with all markers). Lod scores were computed using 

30 the computer program LIPED (Otl, Am. J, Hum. Genet.. 26:588, 1974) assuming age- 
dependent penetrance (Morion, J. Hum. Genet., 7:211 y 1955). 
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Table 2. 

Lod scores for linkage of FAD to chromosome I markers* 



Recombination fraction (9) 



Locus 


Het.' 


0.001 


0.05 


0.10 


0.15 


0.20 


0.30 


0.40 


DIS238 


0.86 


-9.29 


-1.99 


-0.34 


0.42 


0.76 


0.81 


0.43 


D1S422 


0.76 


-7.25 


-1.70 


-0.28 


0.38 


0.67 


0.65 


0.27 


DIS412 


0.71 


-7.82 


•1.66 


0.18 


1.00 


1.31 


1.17 


0.54 


DIS306 


0.61 


-14.71 


-7.75 


^.69 


-2.88 


-1.71 


-0.43 


0.01 


OIS3I0 


0.57 


-6.03 


-1.75 


-0.79 


-0.31 


-0.06 


0.14 


0.11 


DIS249 


0.87 


-9.76 


-1.12 


0.65 


1.44 


1.76 


1.60 


0.81 


D1S245 


0.82 


-12.15 


-2.56 


-0.42 


0.58 


1.05 


1.11 


0.55 


D1S205 


0.80 


-11.66 


-3J9 


-1.45 


-0.48 


0.04 


0.38 


0.20 


D1S425 


0.81 


-8.36 


-2.38 


-1.02 


-0.32 


0.05 


0.27 


0.15 


D1S217 


0.66 


-3.87 


•0.97 


-0.27 


0.10 


0.30 


0.32 


0.11 


D1S237 


0.76 


-4.77 


-1.88 


-0.76 


-0.13 


0.20 


0.36 


0.20 


D1S229 


0.77 


-6.54 


-1.42 


-0.34 


0.12 


0.28 


0.18 


-0.08 


D1S227 


0.67 


-2.23 


0.23 


0.80 


0.92 


0.83 


0.34 


-0.07 


D1S320 


0.70 


-8.99 


-1.23 


0.36 


1.03 


1.27 


1.09 


0.42 


D1S213 


0.86 


-3.80 


-1.57 


-0.80 


-0.37 


-0.13 


0.05 


0.04 


D1S479 


0.80 


-0.05 


3.91 


4.39 


425 


3.81 


2.50 


0.99 


DIS439 


0.80 


-5.59 


1.05 


2.36 


2.78 


2.76 


1.97 


0.73 


DIS251 


0.82 


-15.43 


-5.19 


-2.67 


-1.31 


-0.51 


0.14 


0.14 


DIS225 


0.79 


-10.44 


-2.62 


-0.48 


0.60 


1.12 


1.16 


0.52 


DIS103 


0.80 


0.33 


1.60 


1.79 


1.74 


1.57 


1.05 


0.46 


D1S459 


0.79 


-20.40 


-5.86 


-2.87 


-1.39 


-0.59 


0.02 


0.07 


D1S446 


0.69 


-12.37 


-4.50 


-1.99 


-0.71 


-0.02 


0.46 


0.32 


DIS235 


0.68 


-7.54 


-1.09 


0.27 


0.85 


1.06 


0.88 


0.34 



Lod scores were computed using published allele frequencies except for D1S479 For 
this marker, the allele frequency for the 1 12 bp allele was derived from the VG subjects 
as a group, while frequencies for the other alleles were from published values f 1 7) 
* Het, heterozygosity. ' v /- 
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In Table 3, the family lod score for likage of AD to the marker D1S479 
is presented. Conditions I and 2 are age-dependent penetrance with VG and CEPM 
allele frequencies respectively; conditions 3 and 4 are age-dcpendcnt penetrance with a 
sporadic correction function using VG and CEPH allele frequencies, respectively. For 
5 the H, HB, and R families, maximum lod score were obtained with DI S479. For other 
families, maximum lod values were obtained with other markers (HD; DIS103, Z^,, = 
1.75, 9 - 0.001. condition 4), DIS249 (KS; DIS4I2, Z„„ = 1.20, G = .10, condition 2), 
and Dl S439/DIS227 (W; DIS227 and D1S439, = 0.78, 9 = .001, condition 4). No 
marker was positive for the WFL family under any condition. Under the low (1%) 
1 0 penetrance model, Z^ values were 3.04 (0 = . 1 0) and 1 .27 (9 - . 1 5) for the control and 
VG allele frequencies, respectively. 
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Table 3 

Family lod score for linkage of AD to D1S479. 



pBinily 


Condition 


0.001 


0.05 


n in 


n 1 

U. 1 ^ 


A TA 


A ^A 

U.30 


A ^A 


u 
n 




0.45 


V.JD 


A TO 

VJM 


Alt 


0.15 


0.06 


0.01 




2 


0.44 


ft ij; 




ATI 


0.15 


0.06 


0.01 




3 


0.90 


n 7K 
u. /o 


ft AH 


ft <A 


0.42 


0.21 


0.06 




4 


0.90 


V./o 


U.DO 


ft <A 


0.42 


0.21 


0.05 


no 




1.67 


1 79 

l./o 


1 7 1 
l./l 




1.33 


0.80 


0.26 




2 


1 Mi 

l*VV 


I.// 


1 £j4 
I.M 


I Ay 


1.27 


0.76 


0.25 




3 






13/ 




1.93 


l.3i 


0.59 




4 


C*j 1 




2.30 


Til 

2.12 


1,87 


1.27 


0.57 




] 
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In the VG pedigrees, genotypes are not available from many deceased 
individuals who are related to sampled subjects (e.g. all individuals in generations I-III 
in Fig. 2). Because of this missing data problem, marker allele frequencies can 
influence linkage analysis results. For most of the markers used, allele frequencies in 
5 the VG families were similar to those from controls. However, D1S479, in 5 of the 
families (H, HB, HD, R, and W), was substantially more frequent in the affected 
subjects (0.32 in affected subjects, 0.18 in all VG subjects,) compared to a frequency of 
0.04 in controls. Such an increase in frequency for a closely linked marker allele is 
expected. When control frequencies based on VG unaffected spouses were used for the 

10 1 12 bp D1S479 allele, the maximum lod score for D1S479 increased from Z^^ = 4.40 
(9 - .11) to Z^, = 6.29 (6 = .10) (Table 3). 

Linkage analysis may be affected by the presence in the pedigrees of 
phenocopies (non-AD dementia cases or AD caused by something other than the major 
gene segregating in these families). Phenocopies can reduce the power to detect 

15 linkage and increase the estimated G (Cavaili-Sforza and King, Am. J. Hum. Genet, 
38:599, 1986), Phenocopies are a potential problem in the VG kindreds since the range 
of onset ages for AD in these families extends up to 82 years of age (Table 1 ), which 
overlaps with common lale-onset AD. To correct for the presence of late-onset AD 
cases in the families, a phenocopy correction model was used (Schellcnberg. et ah. Am 

20 J. Hum. Genet. 5J:619, 1993), which assumes that as the age-of-onsel of a subject 
increases, the probability that the case is a phenocopy is increased. Despite the fact that 
in the absence of true phenocopies this model reduces the power to detect linkage, for 
all markers analyzed, the lod scores increased (Margaritte.et al.. Am. J. Hum, Genet. 
JO: 1231, 1992). For D1S479, values increased from 6.29 to 6.51 and from 4.40 to 

25 4.60 using the control and VG allele frequency for the 112 bp allele, respectively. 
Under this model, the maximum D1S479 lod score for a single family for D1S479 was 
2,95 (6 = .09) for the R family. These results support the hypothesis that at least some 
of the AD cases in these families are probably phenocopies. 

Hapiotypes were constructed for the R family using the 23 markers 

30 spanning approximately 55 cM of chromosome 1 (Fig. 7). A common haplotype 
between D1S439/DIS479 and D1S306 was observed in all but 1 of the affected 
subjects (IV- 1, Fig. 7). This individual did not share the disease haplotype across the 
entire region, had an age-of-onset of 67 years, which is greater than 2 SD above the 
family mean, and had an E4/e4 genotype at the APOE locus. This APOE genotype is 

35 associated with lower age-of-onset for late-onset AD (Corder, et al., Science 267:921, 
1993). Thus, this subject may be a phenocopy. In other families, definite assignment 
of a common disease haplotype was difficult because of missing data, and will require 
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extensive multipoint analysis. However, for DIS479, the 112 bp allele segregated with 
the disease in the H, HB. HD, W» and R pedigrees. In the HD family, the 1 affected 
subject who did not have the 1 12 bp allele, had an age-of-onset of 75 years and was t:3/ 
e4 al the APOE locus. However, age-of-onset alone could not be used to 
5 unambiguously determine whether a subject is a phenocopy. In the HB family, subjects 
with onsets of up to 75 years shared the D1S479 112 allele with other younger affected 
subjects. The KS family is the only VG kindred in which no affected subject had the 
DIS479 1 12 bp allele. Analysis of this family is complicated because of the late age- 
of-onset (64.8 years) and because of the 8 affected subjects, 3 were APOE t4/i;4 

10 homozygotes (onset ages of 57, 58. and 67 years) and 4 were g4 heterozygotes (onset 
ages 67, 68, 68 and 71 years). Thus, the KS family could be an example of genetic 
heterogeneity within the VG kindreds, although group heterogeneity tests did not detect 
heterogeneity. In summary, the finding of the same rare allele segregating with AD in 
most of the VG pedigrees supports the hypothesis of a common genetic founder for 

1 5 most of the families. As described herein, the sequence data from VG patients confirms 
this hypothesis. 

Taken together, these data demonstrate the presence of an AD locus at 
Iq3l-42. Analysis of recombinants in the R family define the candidate region as 
between DIS225 (subjects VI-3 and V-3 in Fig. 2) and D1S217 (subject IV-16), a 
20 region spaxuiing 14 cM (Fig. 8), 

EXAMPLE 2 

Cloning of the AD4 gene from chromosome 1 using YACS 

25 YAC clones derived from chromosome 1 were obtained from the CEPH-Genethon 
library and were identified for the region spanning D1S229 to DIS103 by 2 different 
methods. 

In the first method, the CEPH-Genethon YAC library was arrayed for 
PCR-based screening. This publically available library (Bellanne-Chantelot, et al.. Cell 

30 70, 1059-1068, 1992) was obtained from Glen Evans (Salk Institute) as 21,792 clones 
in 227 96-well plates. For DNA preparation, colonies were grown separately in 1.5 ml 
liquid media, harvested, and pooled. Spheroplasis were made by digestion with 
lyticase. DNA was purified by phenol/chlorofomn/isoamyl alcohol extractions and 
precipitated by the addition of sodium acetate and ethanol. After resuspeiision and 

35 RNase treatment, the DNA was ready for PCR amplification. 

A two-dimensional YAC pooling scheme allowed the identification of 
specific YAC addresses by PCR screening (Amemiya, et al., NucL Acids. Res. 
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20:2559-2563, 1992). The library was subdivided into five sets (4 x 48 and 1 x 35 
trays). Duplicates were made of each set. One replica of each of the 5 sets was u.sed 
for the first dimension basic horizontal pools (each pool represents all clones from a 
single 96-well plate). The second replica was used for second dimension basic vertical 
5 pools (pools containing clones from a specific well position such as Al for plates from 
a set). The pools within each set were further combined to facilitate VCR screening. 
For each set, horizontal pools were combined to generated to 4 super-horizontal pools. 
Each super-horizontal pool contained clones from 12 consecutive plates (plates 1-12, 
13-24, 25-36, 37-48). The second dimension was combined to produce 8 super-vertical 

10 pools per set. Each super vertical pool contained clones from each row in a .set (A, B, 
C,...etc.). Hence the entire library was reduced to 19 super horizontal pools (4X4 sets 
+ 3X1 set), and 40 super vertical pools (8X5 sets). In this way, candidate clones can 
be found after two rounds of PGR i.e. a first round with super pools and a second round 
with the basic pools. When multiple hits occur, a final round of PGR is necessarv. 

15 The YAC library was screened using primers for the STS D1S320 and 

D1S479 as shown in Figure 9. This screening yielded the following YAGs: 920e7, 
859b6, 904f3. 

In the second method, a data base was queried. The CEPH-Genelhon 
INFO clone data base contains infomiation concerning the genetic marker and STS 

20 content of YAGs in the GEPH-Gcnethone YAG library. This database was queried 
with the following markers: D1S229, DIS227, D1S213, D1S479, D1S439, D1S225, 
DIS103, D1S495. The YAGs identified were 857h3, 934f2, 957gl0, 881b8, 9l3el0, 
8lOe5, 921dl2, 865c8, 920e7, 859b6, 736b6, 746d7, 748h9, 748e2, 753a5, 753d8, 
753d9. 753f9, 753n2, 753h3, 757d8, 759bl0, 757f9, 76lhl0, 767bl, 767dK 767gl, 

25 769ni. 786a8, 786fll, 820g5, 824g8, 824g9, 824gll, 826b3, 826b5, 83le4, 849a7, 
856a7, 880dl2, 897f9, 899fl, 899hl, 920gl, 904O, 9l5e8, 922al0, 933a8. 958elO, 
978gl0, and 979G. Some of these YAGs contain the marker used in the query and 
some are connected to YAGs which contain the marker used in the database query. A 
total of 60 YAGs were identified by the combined methods. 

30 YAG DNA was prepared fn)m each YAG clone. The YAG DNA was 

then amplified using the primers shown in Figure 9. A YAC contig map partially 
covering the region from D1S229 to DIS103 was constructed. 

EST T03796 (Figure 10), was identified in the EST GenBank database 
as a sequence with homology to the chromosome 14 AD gene clones both at the 

35 nucleotide level and the amino acid level. Oligonucleotide primers, EPIF and EPIR, 
and KM7749 and KM7750 were used for PGR amplification (Figure 1 1). Given the 
EST sequence, the expected size was approximately 140 bp (for primers EPIF and 
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EPIR) or 190 bp (for primers KM7749 and KM7750). When human genomic DNA 
was amplified, a 1,300 bp band was observed. Amplification of the YACs from the 
chromosome I Volga German gene region with the same primers yielded a 2.5 kb 
fragment from four different isolates of VAC 921dl2. One isolate of 921dl2 did not 
5 yield this fragment, but it was smaller and presumably deleted for the 2.5 kb fragment. 
Y AC 92 1 d 1 2 has been previously shown to contain the markers D 1 S439 and D I S479, 

In addition, a panel of rodent-human hybrid cell lines obtained from the 
Coriell Institute for Medical Research were also amplified with the same primers. 
These cell lines contain one or a few human chromosomes on a rodent background 
10 (either mouse or hamster). Only cells containing human chromosome 1 had the 2.5 kb 
band. 

DNA from YAC 92 Id 12 was amplified with primers EP IF and EPIR 
and subsequently with WWF and WWR (Figure 11) to produce a 2,5 kb fragment. 
DNA sequence analysis of this 2.5 kb fragment was performed with primers WWF and 

1 5 WWF 1 L (Figure 2) to give sequence ELL I . I ; (Figure 1 0) this sequence contained at its 
5' end a portion of the EST T03796. This 2.5 kb fragment was also sequenced with 
primers WWR and WWRIR yielding the sequence ELLK2, which also contained a part 
of the DNA sequence from T03796. The 2.5 kb band from YAC 921 dl 2 was 
reamplified, purified and sequenced using the primers: WWF, WWFIL, WWR and 

20 WWRIR. The sequence of these DNA molecules is shown in Figure 10. In this 
fragment, 1 19 bp on the 5' and and 69 bp on the 3* end were identical to the EST 
sequence. In addition, a splice acceptor and donor sequence were observed at both 
junctions of the EST sequence, suggesting that the intervening sequence is an intron of 
the chromosome 1 gene. Thus, EST T03796 is on YAC 921DI2, and is a candidate for 

25 containing at least part of the Volga German AD gene. 

cDNAs encoding the AD gene were isolated as follows. One million 
clones from a Werner syndrome fibroblast cDNA library or B616 human brain cDNA 
library were plated and transferred to Duralon membranes (Stratagene). The filters were 
prc-hybridized for one half hour in Church-Gilbert Hybridization solution (0.5M Na 

30 phosphate, pH7.0. 7% (w/v)SDS, 1% (w/v)BSA) at 65° C and screened using 500,000 
cpm/ml of a 140 base pair PGR product amplified from the EPl primers (140 bp of the 
EST T03796) or a 190 bp PCR product amplified from KM7749 and KM7750 primers. 
Probe was labeled using a random primed method (Doehringer-Mannheim) for half an 
hour and unincorporated nucleotides were removed by a Microspin s-200 HR column 

35 (Pharmacia). The filters were hybridized for 16 hours and then washed 1 5 minutes each 
at 65*^ C in 2X SSC/0.1%SDS followed by a wash in O.IX SSC/0.1% SDS. 
Autoradiographs were exposed for 16 hours and four primary candidates from the 
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fibroblast library and five primaries from the brain library were isolated. The primary 
candidates were positive upon rescreening. Secondary positives were picked into SM 
buffer and rescued using pubhshed protocols (Stralagene). 

In addition, 600,000 clones in an arrayed version (Munroe,et al., Proc, 
5 Natl Acad Sci USA P2:2209-2213, 1995) of the above fibroblast cDNA library were 
screened using PCR methods. The EPl and WW I primer pairs were used to screen the 
plate pools (M8). The EPl primer pair gave positives in plates 1 1 , 1 7 and 36. while the 
WW! primers gave positives in plates 17 and 36. When PCR was done to determine 
column and row addresses only two addresses could be determined: plate 17 C7 (both 

10 primer pairs) and 17 ClI (WW I only). Confirmator>' PCR on the individual wells 
showed product for only \7C\\. Both wells were plated for lertiar>' screening to obtain 
plaque purified phage. The EPl PCR product was used for screening as above; positive 
plaques were isolated for 17 CI 1 and were rescued as above. Positive plaques 
eventually purified were designated IK ISA, 15B, and CI 1, PS2-6, PS2-I0, and PS2- 

15 41. 

The sequence from chromosome 1 was then used to screen several 
cDNA libraries. Positive clones 2.2 kb in length were isolated and sequenced. The 
sequence, shown in Figure K shows clear overiap with the ends of the PCR fragment. 
Additional EST sequences with overlapping sequence, and additional cDNA sequences 
20 were obtained by ftirther library screening. 

EXAMPLE 3 
Mutations in Affected VG patients 

25 Using RT-PCR, (he entire gene has been amplified from both affected 

VG individuals and unaffected individuals. The unaffected individuals are from VG 
families and from unrelated lineages. The sequences obtained have been analyzed. In 
affected individuals, there is a nucleotide change in codon 141 (AAC->ATC) resulting 
in an amino acid alteration from asparagine to isoleucine (N141I). No such change was 

30 found in any imaffected individuals of comparable age. The amino acid sequence in the 
region of the alteration is LLNSVLNTLIMISVIVVMTI in the unaffected and 
LLNSVUTLIMISVIWMTI in the affected individuals examined. Affected subjects 
carrying the N141I mutation were identified in 5 (H, HB, HD, R, W) of the 7 VG 
families initially screened. All subjects with the N1411 mutation also carried the 

35 D1S479 1 12 bp allele (allele H, Fig. 7). Subsequently, 2 additional VG families (E and 
BE) were screened for the N1411 mutation. In both kindreds, the affected subjects 
carried the N141I mutation and the D1S479 1 12 bp allele. In total, the N141I mutation 
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was directly observed in 20 affected subjects and inferred to be present from 
spouse/children genotypes in an additional 6 AD subjecU; onset ages for the 26 subjects 
ranged from 44 to 75 years. The N141I mutation was also observed in 15 at-risk 
subjects (blood relatives of affected N141I carriers) with ages ranging from 23 to 67 
5 years. No VG spouse (n 17) had the N141I mutation. Additional populations 
screened included 84 normal Caucasians, spouses, at-risk subjects and affected subjects ^ 
from 67 late-onset familial FAD families (n = 223 affected subjects), and 48 affected 
sporadic subjects from an AD clinic population. 

In addition to the N 1411 mutation, three other variants were identified at 
10 nucleotide 436 (C-^T, subject HB), 496 (C->T, subjects HD and W) and 628 (C->T, 
subjects HD and W). These variants did not change the amino acid encoded at these 
sites (alanine, asparagine, and hisiidine, respectively). 

EXAMPLE 4 

15 Characterization of the AD4 Gene and its Protein Product 

The sequence of the VG from chromosome I was constructed from 
sequence of several clones as described. The sequence is shown in Figure 1 . The 
amino acid sequence deduced from this DNA sequence is shown in Figure 2. 

20 Comparison of this sequence with the sequence of the AD3 gene from chromosome 14 
shows sequence homology. An alignment of the proteins and genes as constructed 
using the program GAP from the Wisconsin package of sequence analysis software 
(Vcr. 8.0.1 - UNIX, September 1994) is shown in Figure 3. The amino acid sequence 
of AD4 exhibits approximately 67% identity to the AD3 gene product. Some notable 

25 characteristics of the AD4 gene product are shown in Figure 5. 

The protein encoded by the chromosome I AD gene is a membrane 
protein, as judged by the seven hydophobic segments presumed to be membrane 
spaiuiing regions (Figure 4). The AD4 protein is also referred to as STM2. For STM2, 
all the putative transmembrane helices are capped at the carboxyl-terminal end by a 

30 lysine residue, foimd either at the very end or within a few residues of the end of the 
transmembrane domains. The N141I mutation^ a change from a hydrophilic asparagine 
residue to a hydrophobic isoleucine residue, is at a position directly adjacent to the first 
predicted transmembrane domain. 

A northern blot (Clontech) containing 2 ^g of human poly A-plus 

35 MRNA derived from heart brain placenta, lung, liver, skeletal muscle, and pancreas 
was hybridized with the 2.3 kb AD4 cDNA clone as a probe. Hybridization was 
performed in 0.75M NaCl. 0.05M NaH2P04, 5mM Na2EDTA, lOX Denhardfs 
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solution, 100 micrograni/nil salmon sperm DNA, 50% formamide, 2% SDS at 42^C for 
18 hours. Filters were washed with O.IXSSC, 0.1% SDS at 65^C. Northern blot 
analysis of RNA from a variety of human tissues revealed two AD4-related transcripts 
which were approximately 2.4 and 2.8 kb in size. The larger transcript appears to be 
5 primarily expressed in placenta, skeletal muscle and heart, while the smaller transcript 
is detected in heart, brain, placenU, liver, skeletal muscle and kidney but not lung. A 
sequence-tagged site (STS) from the 3* untranslated region was used to confirm that 
AD4 maps to chromosome I and to YAC 921dl2. The two transcripts observed arc 
possibly the result of alternative polyadenylation since two putative polyadenylation 

10 signals were identified in the 3*unUanslated region, at nucleotides 1834 and 2309. The 
message sizes for STM2 from Northern blot analysis (2.4 and 2.8kb) are slightly larger 
than the 2.3 kb size predicted from the full-length DNA sequence and the 1.8 kb size 
predicted by the second polyadenylation site observed in the sequence. Tliis apparent 
discrepancy may be due to the anomalous migration of size markers which has been 

1 5 previously observed with this type of Northern blot. 

The normal cellular function of STM2 i$ unknown. If this protein is 
functionally similar to spe4, the C elegans gene with sequence similarity to STM2, it 
may play a role in the cytoplasmic partitioning of proteins. Mutations in STM2 could 
alter intracellular protein trafficking of APP and ultimately lead to altered APP 
20 processing and increased production of Api-40 or Apl-42 (amyloid p protein). 
Consistent with this notion, recent work with fibroblast cell lines from AD3 carriers has 
shown that secretion of Ap ending at position 42 is increased relative to non-carrier cell 
lines. Alternatively, STM2 may ftinclJon as a G-protein coupled receptor or as an ion 
channel. 

25 

EXAMPLE 5 
Cloning of AD gene on chromosome 1 



The gene may be obtained from genomic or cDNA libraries or from 
30 genomic DNA or mRNA. The nucleotide sequence of the gene provided herein may be 
used to design oligonuclotides. The primers are optimally 1 8-25 nucleotides in length 
and preferably contain not more than 50% AT base pairs, do not end in A. and should 
not have self-complementary regions. 

The oligonucleotides may be used as hybridization probes on cDNA or 
35 genomic libraries. For cDNA libraries, mRNA from a specific source, such as a cell 
line or tissue sample, is transcribed into cDNA and inserted into an appropriate vector 
(e.fi.^ XgilO, XZAP, pBS, or pSPORT). (see Sambrook et al.. Molecular Cloning, Cold 
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Spring Harbor, 1989). The recombinant vector is introduced into an appropriate 
bacterial strain {e.g., DH5a or KW257) and the cells plated on growth medium. 
Alternatively, cDNA libraries representing RNA from various sources are obtained 
commercially (e.g., Clontech Laboratories). In either case, colonies (for plasmids) or 
5 plaques (for phage) are transferred to a nitrocellulose or nylon membrane and prepared 
for hybridization (Sambrook, supra). Oligonucleotide probes are labeled at their 5' end 
with using T4 polynucleotide kinase. Hybridization and washes are performed 
under standard conditions for the length and GC content of the probe (see Sambrook et 
al., supra). 

10 For isolation of genomic clones containing the AD4 gene, genomic 

libraries are constructed by standard methods (Sambrook et al.. supra) or obtained from 
commercial sources (Clontech, Stratagene, Research Genetics and Genome Systems). 
Libraries are constructed in phage, cosmids, PI. BACs, or YACs and probed as 
described for cDNA libraries. Colonies or phages thai hybridize are picked, purified 

1 5 and grown for preparation of DN A. 

If a full-length sequence is not obtained by the cloning and screening 
methods described, full-length sequences may be obtained by screening additional 
clones, by RACE (Frohman and Martin, Proc. Natl Acad Sci USA 6*5:8998-9002, 
1988), ligation-anchored PCR (Troutt et al., Proc. Nail Acad Sci USA ^^9:9823-9825, 

20 1992) or other similar methods. Verification that a clone contains AD4 sequence may 
be obtained by determining the DNA sequence of the clone and comparing it to the 
sequence of AD4 provided herein. Sequence identity of 90% or better indicates that the 
clone encodes AEM. Some mismatch may be due to polymorphisms or DNA sequence 
errors. 

25 

EXAMPLE 6 
Detection of mutations 

Mutations in the AD4 gene may be determined by nucleotide sequence 
30 analysis. Peripheral blood cells are obtained from patients by venipuncture and 
hypotonic lysis of erythrocytes. DNA or mRNA is isolated from these cells and the 
AD4 gene isolated by amplification. Oligonucleotide primers are chosen based on the 
cDNA sequence provided herein or genomic sequence. A computer program such as 
Primer 2.2, which is publically available from the web-server at the Whitehead/MIT 
35 Center for Genome Research, assists in choosing oligonucleotide primers. If mRNA is 
used as a source of nucleic acids, first strand cDNA if synthesized by standard methods. 
Amplification of DNA is performed and the DNA sequence of the PCR product is 
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determined. A comparison of the sequence obtained from the patient and from a 
normal control allows identification of mutations. 

In addition, the NHU mutation in AD4 gene creates a &2/3A I 
restriction site, which may be used to rapidly screen for mutant alleles. Genomic DNA 
5 or cDNA may be directly screened for the Sau^A I restriction site or amplified by PGR 
using primers which flank the 141 codon. DNA or amplified DNA is restricted with 
Sau3A I and electrophoresed in an appropriate percentage agarose gel. The presence of 
the Sat43A I site is detected by ethidium bromide fluorescence for amplified products or 
by probe hybridization following transfer of the gel to nylon membrane. Suitable 

10 probes are oligonucleotides or DNA fragments containing sequence near or 
encompassing the 141 codon. Preferably, in the case of an oligonucleotide probe, there 
is at least 24 bases that do not span codon 141 . 

As shown in Figure 12, the N14II mutation is detectable after 
amplification. Approximately 60-100 ng of genomic DNA was amplified with primers 

15 WWF and INTIR (intronic primer 5*-GTGGGGCAGACGGAGAGAA-3') at 40 ng 
each, L5mM MgCKand 200 ^MdN ^Ps. Amplification was performed with an initial 
incubation at 95° C for 3 min. followed by 35 cycles of 45 sec. at 94° C, 45 sec. at 
60** C, and 45 sec. at 72° C, with a final incubation for 1 0 min. at 72**. The 1 60 bp PGR 
product (in 20 ul) was digested with 10 U Sau3A and the fragments resolved by 

20 electrophoresis in a non-denaturing 12% poly aery lamidc gel. Fragments were 
visualized by ethidium bromide staining. The N14II mutation results in fragments of 
102 bp, 58 bp, 43 bp and 15 bp (not shown) while non-carriers have only the 102 and 
58 bp fragments. Lanes arc: 1, al-risk, 79 years; 2, affected, onset 60 years; 3, at-risk, 
67 years of age; 4, affected, onset 75 years (presumed phenocopy); 5, Boehringer 

25 Mannheim DNA size standard V; 6, affected, onset at 52 years; 7, affected, onset 46 
years; 8, affected, onset 49 years. 

EXAMPLE 7 

Determination of the DNA Sequence of the AD4 Kenomic ref>ion 

30 

A 2.5 kb fragment of DNA from the AD4 gene was amplified from YAC 
92 Id 12 DNA using primers WWF and WWR and radioactively labeled by nick- 
translation (Sambrook, et al. supra). A human genomic clone hybridizing to this 
fragment was identified in a commercially available library (Genome Systems PI 
35 library). The hybridization was carried out according to the manufacturers instructions. 
PGR amplification using primers specific for the 5' and 3' ends of the AD4 cDNA 
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confirmed that the PI clone (clone #6004 in the nomenclature of Genome Systems) 
carried the entire AIM gene. 

DNA from PI clone 6004 was sheared and subcloned into M!3mpl8 
vector prepared by digestion with Smal and dephosphorylation. The vector was 
5 purchased from Novagen (Catalog # 69996-1 ) and the fragments were cloned according 
to the manufacturers instructions. Approximately 1600 clones were isolated. The 
sequence of the insert was determined by fluorescent detection on an ABI373A 
according to the manufacturers instructions. The data was assembled using the 
program Xgap (Sladen, R- (1994) The Staden package. In **MeUiods in molecular 
10 biology" A.M. Griffin and H.G. Griffin, Eds.» Vol 25, pp. 9-170, Humana Press). 
Exons were identified by entering the cDNA sequence into the assembly. 

The sequences of the 12 exons and the flanking iniron sequence of the 
AD4 gene are shown in Figures 13 to 19. The relative positions of the exons in these 
sequences are listed in Table 4. 

15 

Tabic 4 

Position of exons in the genomic sequence 





Exon 


Genomic 


Exon 


Exon 


20 




seauence 


position 


length 




exonl 


stm.genA 


546 


143 




exon2 


stm.genB 


850 


186 




exon3 


stm.genC 


327 


161 




exon4 


stm.genC 


2140 


215 


25 


exon5 


stm.genD 


168 


142 




exon6 


stm.genE 


808 


68 




exon? 


sim.genE 


1555 


221 




exonS 


stm.genF 


102 


99 




exon9 


stm.genF 


1345 


84 


30 


exon 10 


Stm.genF 


1809 


102 




exonl 1 


stm.genG 


1053 


117 




exon 12 


stm.genG 


2470 


683 



8^fSIXX:lO: <W0 9703192A3_IA> 




PCT/US96/n386 



EXAMPLE 8 

Isolation of the ecne fro m patient DNA and detection of polvmorphtsms. 

Treatment of Alzheimer's disease is more effective if begun prior to the 
5 onset of neural degeneration. However, current diagnostics only effectively identify 
patients after substantial loss of cognitive function. Basically, loss of cognitive 
function in an older patient without any detectable cause is the primary indicator of 
Alzheimer's disease. Strict confinmalion of the diagnosis is made by autopsy. Any 
treatment which blocks the initiation or progression of Alzheimer^s disease will be 

1 0 much more effective if an earlier diagnosis is made. 

One dominant mutation of the AD4 gene is known to cause early-onsei 
Alzheimer's disease. Other mutations in this gene or in other members of the same 
gene family may cause or contribute to Alzheimer's disease. We provide herein 
methods for detection of mutations in the AD4 gene. The same techniques will allow 

1 5 detection of mutations in other members of the gene family. 

Ten pairs of oligonucleotide primers used to PCR amplify ten fragments 
carrying the 12 AD4 exons are listed in Table 5. DNA is isolated from patient blood 
samples and individual PCR reactions carried out with each set of primers using the 
Perkin-Elmcr PCR kit according to the manufacturers instructions in a Perkin-Elmer 

20 9600 PCR machine with the following cycle conditions: 2 minutes at 92°C, 35 cycles 
of 20 seconds ai 92°C, 45 seconds at the indicated annealing temperature, and 3 
minutes at 72°C, followed by 7 minutes at 72°C. PCR fragments are purified by gel 
filtration in a 96-well fomial spin column with Sephracyl-500HR resin (Wang, Gan 
Boysen and Hood, AnaL Biochem. 226: 85-90, 1995). Mutations are detected by 

25 fluorescent dye terminator sequencing on the ABI373A using the sequencing primers 
specific for each exon as listed in Table 6. Many other mutation detection formats are 
possible using the PCR amplified fragments. 

A diagnostic test vnW have several uses in addition to the eventual use of 
targeting patients for early intervention vnih Alzheimer's drugs. First, clinical trials oJ* 

30 any Alzheimer's drug would benefit from monitoring the allelic variations in genes 
which predispose towards the disease. The genetic background of the patients may be 
correlated with different mechanisms for initiation of the disease: a drug which is 
effective against one disease mechanism may not be effective against disease caused by 
other mechainsms. Second, the diagnostics may be used to choose the appropriate 

35 treatments if clinical trials show that different genetic backgrounds are correlated with 
the efficacy of drugs. 
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Tables 

Primers for ?CR of cxon fragments 



Exons 



6,7 



9.10 



I2,31JTR 



Piimer Sequence (5-3*) 

INTI05L GGCGTTTTGnCTTCTTCTCTC 
ACCAAAGffiTACAGAGTTTTCCC 

n^lOSL CATCCCCAAGTGGTATGTGnA 

n^lOTR aCCCCAGCCGTGCTTGTG 

INTIIIL CGTTTGAnGACAGGCATaCT 

INTI I2R CAAGGTGKAGAGTGGACAG 

INTl 15L CATAT6CCCTAGTAGCTCATAG 

16R CCATTATACGAACAAGGAAGCTG 

INTl I3L GCTCATGGG6ATGGTGCCC 

nnt I4R GAAGCCGaGTCTAAAAGCnAT 

IKri02L CCCTCCCAGCGTAGGCATGAA 

XtnmK GCAGAAAGGGATGCAAGCCC 

INT103L GGCATGaCTGAGAfiCTCCA 

nnWR GCCCAGAAGCTCCnCCTCC 

immi TGOACCCCTCCCACAACGG 

INTI04R TCACAGAGATGCCTCTGATGG 

nnt HL ACTGGTmrACCTGAGrrrCAGA 

INTIISR C6AGCACACCTCCTGT6AG 

imTL GGTGTCTAGCACCGnATa 

DMOI009 ACAGGAaCATCTAnTATGGATA 



PCR product PCR conditions Anncaliog 
size** Mga2 (mM) tcmpcratorc C<^ 



956 



610 



549 



596 



1149 



1200 



572 



S66 



568 



803 



1.0 



i.O 



1.0 



0.75 



2.0 



1.0 



1.0 



1.0 



2.0 



60 



60 



61 



56 



60 



62 



60 



60 



58 



60 
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Table 6 

Primers for sequencing exons from PCR templates 
Primer sequence Primer Position 



Exon 1 


AACACCGTTCATACCTT 


808 


Exon 2 


CAATTGTCTGGGTATCA 


800 


Exon 3 


CTTTGTCTCACAGGAAA 


266 


Exon 4 


CGTGCATTACATGGATA 


2062 


Exon 5 


GTCCAGAATCACTCAAG 


79 


Exon 6 


ATCAGTCTCAGGATCCT 


719 


Exon 7 


GCTGACTGTCTGGAACT 


1584 


Exon 8 


CCAGTCAACTCTGAAAG 


288 


Exon 9 


CTTGCTCAATCTTCAAG 


1582 


Exon 10 


CCTGGTAACACTCTGAC 


1708 


Exon 1 1 


TCCACATCTTAGCTTCTAG 


912 


Exon 12, 5* 


CTAGACCATGACTCACAG 


2397 


Exon 12,3' 


AGAGGGGTCAACATTTAC 


3179 



20 



EXAMPLE 9 
Expression of STM2 

The expression pattern of STM2 was examined in multiple tissues by 
Northern blot analysis (Fig. 20). A probe was generated by amplifying the STM2 
cDNA clone using primers ADC22R2 and ADC22L5. Identical patterns were obtained 
with probes made using primer pairs DM0 1 01 8 and DM0 102 1 or DM0 1009 and 
DMO1014. A 2.4 kb message corresponding to STM2 was observed in all tissues 

30 examined, including all regions of brain sampled. (In kidney, liver and lung, longer 
exposures of the same Northern blot showed STM2 to be present at low levels in these 
tissues; data not shown). High expression levels were observed in the pancreas, heart, 
and skeletal muscle. In these three tissues, a second 2.8 kb transcript was observed. 
From the STM2 cDNA sequence, two potential polyadenylation signals were identified 

35 at nucleotides 1834 and 2309 (total clone length, 2167 bp). To determine whether the 
2.4 and 2.8 kb transcripts could represent different length 3* UTR polyadenylation 
variants, RNA biols were hybridized with a probe scanning the entire 3* UTR and with 
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a probe starting after the first poly adenylat ion site. Both probes gave identical patterns, 
indicating that only the second polyadenylation site was used. 

In an attempt to identify homologues of STM2. liver and pancreas 
cDNA v^as amplified with degenerate primers based on the STM2 sequence. Twcnty- 
5 two Ml 3 recombinant clones were identified and sequenced, all corresponding to 
STM2. Seven of the 22 clones had the sequence GAC5ATGGAAGAC beginning at 
nucleotide 1327 in the cDNA sequence, which is 3 bp shorter than the previously 
reported sequence {AGATGGjAAGAAGAC, where "|" indicates the junction between 
exons 9 and 10). The shorter transcript encodes a predicted protein one amino acid 

10 shorter (EMED) versus EMEED) writh no frameshift. The shorter transcript is likely a 
result of alternative splicing of exon 10, with the first coding AG dinucleotide used as 
the splice acceptor sequence or a polymorphism. To distinguish between these 
possibilities, RNA and DNA were prepared from a lymphoblastoid cell line from a 
normal (non-AD) VG subject in the HD family. Both transcripts were present in RT- 

15 PGR products using primers EXI03L and EX102R from exons 9 and 10. However, 
when the intron 9 to exon 10 region of genomic DNA from the same subject was 
sequenced, a single sequence that corresponds to the genomic sequence was obtained. 
Thus the two different transcripts arise from alternative splicing. Both transcripts were 
also observed in leukocytes, skeletal muscle, and one fetal brain librar>' (Clontech). 

20 with the longer transcript being more abundant. However, in another fetal brain library 
(Slratagene), only the longer transcript was obscr\'cd, while in a foreskin fibroblast 
library, only the shorter transcript was found. 

Table 7 

25 Primers for RNA analysis 

Primer Location Primers (5*-3*) PGR product Conditions Annealing 

sizc{kb) MgCJ2 Icmpcralurc (°C) 



I 


EXI03L 


Exon 9 


CTGCCATGGT6TGGACGGTT 


170/173 


1.5 


60 




EX102R 


Exon 10 


CTCCA6CTCCTCCCCTGGG 








2 


ADC22R2 


3*lJrTR 


GGACAAAGCATTGG6AACACT 


408 


1.5 


56 




ADC22L5 


3'UTR 


TGAGGGACATGGTGTGCCACAGG 








3 


DMOI0I4 


3mR 


6AACTGA6AAGGTCAGATTA 


26« 


1.5 


56 




DM01 009 


3'UTR 


ACAGGACTCATCTAniATCCATA 








4 


DM01018 


3'trrR 


ATTGGATGCAGnGTATAGT 


229 


1.5 


56 




DM01 021 


3*UTR 


TAATCTGACCnCTCAGnC 
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From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
5 invention. Accordingly, the invention is not limited except as by the appended claims. 
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Claims 

We claim: 

1 . An isolated nucleic acid molecule encoding an AD4 gene product. 



2. The isolated nucleic acid molecule of claim 1 wherein said molecule is 
set forth in Figure I . 

3. The isolated nucleic acid molecule of claim I wherein said nucleic 
acid molecule encodes a mutant AD4 gene product that increases the probability of 
Alzheimer's Disease. 



4. The isolated nucleic acid molecule of claim 3 wherein said nucleic 
acid molecule encodes a mutant AD4 gene product with an amino acid subsituiion at residue 
141. 

5. The isolated nucleic acid molecule of claim 4 wherein said substitution 
at residue 141 is an aspargine to isoleucine substitution. 

6. An isolated nucleic acid molecule selected from the group consisting 

of 

(a) an isolated nucleic acid molecule as set forth in Figure I or 
complementary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to 
tlte nucleic acid molecule of (a) under conditions of high stringency; and 

(c) an isolated nucleic acid that encodes an AD4 gene product. 

7. An expression vector, comprising a promoter operably linked to a 
nucleic acid molecule according to any one of claims I to 6. 

8. The expression vector according to claim 7 wherein said promoter is 
selected from the group consisting of CMV l-E promoter, SV40 early promoter and MuLV 
LTR. 

9. The expression vector according to claim 7 wherein said promoter is a 
tissue-specific promoter. 



BNSDOCID: <WO 9709192A3_IA> 



wo 97/03192 PCT/US96/1 1386 

58 



10. A viral vector capable of directing the expression of a nucleic acid 
molecule according to any one of claims 1 to 6. 



II- The virai vector according to claim 10 wherein said vector is selected 
from the group consisting of herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. 

1 2. A host cell carrying a vector according to any one of claims 7 to 1 K 

13. The host cell according to claim 12 u^herein said cell is selected from 
the group consisting of human cell, dog celK monkey cell, rat cell and mouse ccIL 

1 4. An isolated protein comprising an AD4 gene product. 

15. The isolated protein of claim 14 wherein said protein consists 
essentially of an AD4 gene product. 

16. The isolated protein of claim 14 wherein said protein has an amino 
acid sequence set forth in Figure 2. 

17. The isolated protein of claim 14 wherein said protein is a mutant AD4 
gene product that increases the probability of Alzheimer's Disease. 

18. The isolated protein of claim 17 wherein said mutant AD4 gene 
product has an amino acid subsitution at residue 141 . 



19. The isolated protein of claim 17 wherein said substitution at residue 
141 is an aspargine to isoleucine substitution. 

20. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a vector containing or expressing a nucleic acid molecule 
according to any one of claims 1, 2, or 6, thereby reducing the likelihood or delaying the 
onset of Alzheimer's Disease in said patient. 

21. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a protein according to any one of claims 14 to 23, thereby reducing 
the likelihood or delaying the onset of Alzheimer's Disease in said patient. 
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22. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient an antibody specific for a protein according lo any one of claims 17 
to 19, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in said 
patient. 

23. The method according to any one of claims 20-22 wherein said method 
comprises in vivo administration. 

24- A pharmaceutical composition comprising a vector comprising the 
nucleic acid molecule according to any one of claims 1 and 2, along with a pharmaceutically 
acceptable carrier or diluent. 

25. A pharmaceutical composition comprising a host cell according lo any 
one of claims 12 and 1 3, along with a pharmaceutically acceptable carrier or diluent. 

26. A pharmaceutical composition comprising a protein according to any 
one of claims 1 4 to 16, along with a pharmaceutically acceptable carrier or diluent. 

27. An antibody which specifically binds to the protein according lo any 
one of claims 14 to 19. 

28. The antibody according to claim 27 wherein the antibody is a 
monoclonal antibody. 

29. The antibody according lo claim 28 wherein the antibody is selected 
from the group consi sling of an Fab fragment, an Fv fragment and a single chain antibody. 

30. A hybridoma capable of producing the antibody according lo claim 28. 

31. A pharmaceutical composition comprising an antibody according to 
claim 27, along with a pharmaceutically acceptable carrier or diluent. 

32. A nucleic acid probe which is capable of specifically hybridizing to an 
AD4 gene under conditions of high stringency. 
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33. A nucleic acid probe comprising at least a portion of the nucleotide 
sequence shown in Figure 1 or its complementary sequence, said probe being capable of 
specifically hybridizing to a mutant AD4 gene under conditions of high stringency. 

34. A probe comprising at least a portion of the nucleotide sequence 
shown in Figure 1 or its compiementar>^ sequence, said probe being capable of specifically 
hybridizing to a mutant ADA gene, in which amino acid residue 141 is changed from 
asparagine to isoleucine, under conditions of very high stringency. 

35. The probe according to any one of claims 32 to 34 wherein said probe 
comprises at least 12 nucleotide bases. 



36. A pair of primers capable of specifically amplifying all or a portion of 
a nucleic acid molecule according to any one claims 1 to 6. 

37. The primer according to claim 36 wherein said primer comprises at 
least 12 nucleotide bases. 



38. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 

(b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant AD4 gene under conditions and for lime sufficient lo ' 
allow hybridization to occur; and 

(c) detecting the presence of hybridized probe, and thereby determining 
that said patient has an increased likelihood of contracting Alzheimer's Disease. 

39. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 

(b) amplifying a selected nucleic acid sequence associated with a mutant 

AD4 gene; and 

(c) detecting the presence of an amplified nucleic acid sequence, and 
thereby determining that said patient has an increased likelihood of contracting Alzheimer's 
Disease. 



BNsocxar 



97031 92A3.IA> 



wo 97/03192 



61 



PCT/US96/11386 



40. A melhod of diagnosing a patient having an increased likelihood of 
contracting Aizheinner's Disease, comprising: 

(a) contacting a biological sample obtained from a patient with an 
antibody that specifically binds to a mutant AD4 protein, under conditions and for a lime 
sufficient to allow binding of the antibody to the protein; and 

(b) detecting the presence of the bound antibody. 

41. A peptide vaccine comprising a portion of a mutant AD4 gene product 
containing a mutation, in combination with a phannaceulically acceptable carrier or diluent. 

42. A transgenic animal whose germ cells and somatic cells contain an 
AD4 gene which is opcrably linked to a promoter eflective for the expression of said gene, 
said gene being introduced into said animal, or an ancestor of said animal, at an embryonic 
stage. 

43. The transgenic animal according to claim 42 wherein the animal is 
selected from the group consisting of a mouse, a rat and a dog. 

44. The transgenic animal according to claim 42 wherein AD4 is 
expressed from a vector according to any one of claims 7 to 11. 

45. The transgenic animal according to claim 42 wherein the AD4 gene 
encodes a mutant AD4 gene product. 

46. An isolated DNA molecule comprising AD4 gene sequence presented 
in Figures 13 to 19. 
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FIGURE 1A 

The coding region for AD4 begins a scarr codon beginning at 
nucleotide 368 gct(acg)ci:c and ends with a scop codon beginning 
at nucleotide 1712 atc(tga)ggg 

1 cgagcggcgg cggagcaggc atttccagca gtgaggagac agccagaagc 
51 aagcttttgg agctgaagga acctgagaca gaagctagtc ccccctctga 
101 attttactga tgaagaaact gaggccacag agctaaagtg acttttccca 
151 aggtcgccca gcgaggacgt gggacttctc agacgtcagg agagtgatgt 
201 gagggagctg tgtgaccata gaaagtgacg tgttaaaaac cagcgccgcc 
251 ccctttgaaa gccagggagc atcattcatt tagcctgctg agaagaagaa 
301 accaagtgtc cgggattcag acctctctgc ggccccaagt gttcgtggtg 
351 cttccagagg cagggctacg ctcacattca tggcctctga cagcgaggaa 
401 gaagtgtgr.g atgagcggac gtccctaatg tcggccgaga gccccacgcc 
45 1 gcgctcccgc caggagggca ggcagggccc agaggatgga gagaacactg 
501 -.ccagtggag aagccaggag aacgaggagg acggtgagga ggaccctgac 
551 cgctatgtct gtagtggggt tcccgggcgg ccgccaggcc tggaggaaga 
601 gctgaccctc aaatacggag cgaagcacgc gatcatgctg ttcgcgcctig 
651 tcactctgtg catgaccgtg gtggtagcca ccaccaagtc tgtgcgcttc 
701 tacacagaga agaatggaca gctcatctac acgccattca ctgaggacac 
751 accctcggtg ggccagcgcc tcctcaactc cgtgctgaac accctcacca 
801 tgatcagcgt caccgtggtc atgaccatct tctcggcggt gctctacauig 
8S1 taccgctgcc acaagttcac ccatggccgg ttgatcacgt cttcactgac 
901 gctgctgttc ctcttcacct atatctacct tggggaagcg ctcaagacct 
951 acaatgtggc catggactac cccaccctct tgctgactgt ctggaacttc 
1001 ggggcagcgg gcatggtgtg catccactgg aagggccctc tggtgctgca 
1051 gcaggcctac ctcaccatga tcagtgcgct catggcccta gtgttcatca 
1101 agtacctccc agagtggtcc gcgtgggcca tcctgggcgc catctctgtg 
1151 tacgatctcg tggctgtgcc gcgtcccaaa gggcctctga gaatgctggt 
1201 agaaactgcc caggagagaa atgagcccat attccctgcc ctgatatact 
1251 catccgccat ggtgtggacg gttggcatgg c^aagctgga cccctcctct 
1301 cagggtgccc tccagctccc ctacgacccg gagatggaag aagactccta 
1351 cgacagcttt ggggagcctc cataccccga agtctccgag cctcccttga 
1401 crggctaccc aggggaggag ctggaggaag aggaggaaag gggcgcgaag 
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FIGURE IB 



X451 cctggcctcg gggactccat cctccacgt gtgctggtgg gcaaggcggc 

X501 tgccacgggc agcggggacc ggaataccac gccggcctgc ttcgcggcca 

1551 tcctcaccgg cccgtgcctg accctcccgc cgcccgctgt gtccaagaag 

1601 gcgccgcccg ccccccccac etccatcacg cccgggctca tctctcacct 

1651 ccccacggac aacctggcgc ggcegtccat ggacaccctg gccccccacc 

1701 agccccacat ccgagggaca Cggcgtgcca caggccgcaa gctgcaggga 

17S1 atctccattg gacgcagttg tatagtctca cactctagtg ecaeatatcc 

1801 ttaagactcc tctttcctca aaaaataaag taegtgttca ctcggtgagg 

1851 aggaggcaga accagcsccc tggtgccagc tgcttcatca ccagacttcg 

1901 gctcccgctc tggggagcgc ctcgctscac ggacaggaag cacagcaggt 

1951 ecatccagac gaactgagaa ggccagacta gggcggggag aagagcatcc 

2001 ggcacgaggg ctgagacgcg eaaagagcgc gctcgggagc ggcccccggc 

2051 acccgggtgc cctggctgga gaggaaaagc cagtccccca cgaggagcgt 

2101 tcccaacgcc tcgtccacga tgtcctcgct actctaccgc ccceagaaac 

2151 tgagrcccgc Ccccgttacg gcagtcacac tgccgggaag Cggcccaaca 

2201 gcaataecaa taaatagaCg agtcccgcca gaaaaa 



t 
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FIGURE 2 



Procein se<;uence of 44B amino acidc t-h^^ 

of Che Hunwm AD4 gene. «P^«s«»ts the gene prcduc: 



1 '^•f^^SE EEVCDERTSr. MSAESPTPRS CQEGRQGPED CEOTAQWHSQ 

51 E^^SI«EEDP DRYVCSGVK; RPPGLEEELT LKVOAlraviM tJ^LCMI 

101 WVATIKS-^ FYTEmMLI YTPSTEDTPS VGQHI^SVL HTLIMISVIV 

ISl VMTIFLVVLy KVRCTOIHG WLDlSSLfflX FLrrmL^ 

201 yPTLLLTVWN FGAVGMVCIH WRGPLVLOQA YLIMlSAUa LVFIICYLPEW 

251 VTOLVAVLCP KGPUWLVET AQEHNEPIFP ALIYSSAMVW 

301 TVGHAKLDPS SQGA^JLPTO PEHEEDSYDS FGEPSYPEVF EPPLTOVPGE 

351 n.EEEEEI«W KMUariKY SVLVGKAAAT <»GIWN^ 

401 LTLLLLAVFK KALPALPISI TTOLIFYFST nNLVRPFMDT LASHQLYl 
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FIGURE 3 



sequence aligianenc of the Chromosome X AD4 gene produce with che 
Chromosome 14 AD3 gene product. 

AD4 1 MLTFMASDSEEEVCDERTSLMSAESPTPRSCOECTQ^EIWEIITAC^ SO 

I -= I- I = "I I III .:.|: -I - -II 

AD3 1 MTELPAPLSYTQ. .HAQMSEDNHLSNTVSSQOTNRERQEaND. . . .RHSL 44 

AD4 SI ENEEDCETOPDRYVCSGVPGRPPGI^tTUre^^ 100 

AIM 101 Y^-;^TIlCSVRFYTEKIIGaLryrPFTEOT X50 

.e "ll|l!l|.||| hllllllllllll-.IIII |:|:|l. Ilinil 

AIM ISl yS^yiSXyVLYSTOCTK^IKG^ 200 

i 1 1 I H i I 11 . I : I I I : I I I : I I I : I . : I I I I I I . 1 1 I I I I . I 

AD4 201 yPT^TVWNFGAVGJWCIHWKGPLVLQOAYLIMISA^^ 2=0 

AD4 251 SAwvxuiXLs\ra:>Lyni}r[^ 3oq 

'''''l'-IHIIif||t|||||||||||||||||l.:|]|||{|l If I 

AD4 301 TVCaiAKLDPSSQ. . GALQLPYDPEMEE . . . . DSYDSFGEPSYPEVFEPPL 344 

AD3 295 LVNMAEGDPEAQRRVSKNSKTNAESTERESQDTVAZNDDGXSFSEEM^ 344 

AD4 345 TGYPG £eL EEEEERGVKLGIjGDFIFYSVLVG 375 

AD3 345 DSHMPHRSTPESRAAVQiLsstlAciDpliii^ 394 

AD4 37e KAAATGSGrM^^TtACF^^ 425 

AD3 395 KASATASGDWTOTlACFVAILlGLCLTr^IAIFKKALPALPISITFGLV 444 

AD4 426 FYFSTDNLVRPFMDTIASHQLYI 446 

lll-ll IMIII II IhN 

AD3 445 FYFATDYLVQPFMDQLAPHQFYI 467 
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FIGURE 5 



Sequence features of the Human AD4 



gene product 



1 HLTFMASDSE EEVCDE31T5L JISAESPTPRS CQEGRQGPED GENTAQWRSQ 

51 ENEEDGEEDP DRYVCSGVPG RPPGLEEELT LKYGAKHVIM LFVPVTtCMI 

101 VWATIKSVR FYTEKNGQLI YTPFTEDTPS VGQRLLNSVL NTLIMISVIV 

151 VMTIIXWLY KYRCYKFIHG WLIMSSLMLL FLFTYIYLGE VLKTYNVAMD 

201 yPTLLLTVWN FGAVGMVCIH WKGPLVLQQA YLIMISALMA LVFIKYLPEW 



251 SAWVILGAIS VYDLVAVLCP KGPLRMLVET AQERNEPIFP ALIYSSAMVW 



301 TVGMAKLDPS SQGALQLPYD PEHEEDSYDS FGHPSYPEVF EPPLTGYPGE 
351 ELEEEEERGV KLGLGDFIFY SVLVGKAAAT GSGDWNTTLA CFVAILlGLC 

Putative TGlycosylat ion Site (N 386) 



401 LTLLLLAVFK KALPALPISI TFGLIFYFST DNLVRPFMDT LASHQLYI 



Conserved tetrtoer (KGPL 222-225 J 



Conserved tetramer (KGPL 271-274) 
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DOTPLOT COMPARISON OF THE PRODUCTS OF HUMAN AD4 AND AD3 GENES 



HUMAN AD4 GENE PRODUCT (l TO 448) 




Fig. 6 
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FIGURE 9 



D1S103 ACCSVAClTTCTACAAGTiac 
"TCaGACaAACTGacCTGT 

DlS2i3 CararCCAAGCTCAGGaGG 

AGCrGTXAATCCAATCXATGArGTG 

^3225 TGGCCTSAA=aGACXAI3lAAAA 
GCCTGGGTGACAAAGai 

D1S227 AGCTGTCrCGGGTCTTOA 

GaGcsarrrcAGaATGTrcc 

D1S229 GCTTCTTTCOirrXATGGTG 

ACrCTAGrTGTGTGTGAArGTATG 

D1S32C TTTTGGCACCAGTaATaTCTTG 
TTGCaGACTCTGAACrTAGAACA 

D1543 9 CACaGACTTGVTTJlGAGGGG 
GTTGaAATGGTGAATTTGGA 

d^3459 AGCTGGGGTGrrriATGGAG 
- GGGGACCCTGTTTCTGCrAC 

D1S479 ACCCATTGCCacCarC 
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FIGURE 10 

dna sequences 

788IIH5.SEQ 

1 ACTCAAATCG TCCTGTGACC ACGCGTOUU; TGCTGATGGG GCACAGCAAC 
SI rrCCGGGCCT ATCATATCTC CTTGACCTCG TCCCTCAAAT (rreGTAGTTT 

IS ^^^"^'^'^'^ '^'=^°=*« AAGTATGTTC AAAATCoS 

ISl TCTATAGGCA GGTCCTTCCA AAGTCCAATA GTCCAAGGTG GACGCTT 

EST T03796 

I GAGAAGGTAA GAGCTGACCC TCAAATACGG AGCGANACGT NATCATGCTC 
51 TTTGTGCCTG TCACTCTCTG CATGATCGTG GTGGTAGCCA CCATC^ 
101 TGTGCGCTTC TACACAGAGA AGAATGGACA GCTCATCTAC ACCCCArPCA 
ISl CTGACGACAC ACCCTCGGTG GGCCAGCCCC TCCTCAACTC CGTGCTGAAC 
201 ACCCTCATCA TGATCAGCGT CATC3TGGTT ATGACCATCT TCTTGGTGGT 
2S1 GCTCTACAAG TACCGCTGCT ACAAGTTCAT CCATGGCTGG TTGATCATGT 
301 CTTCACTGAT CCTGCTGTTC CTCTTCACCT ATATCTACCT TGGGGAAGTG 
3S1 CTCAAGACCT ACAATGTGGC CATGGACTAC CCCACCCTCT TGCTGACTGT 
401 CTGGGAACTT CGGGGCAGTG GGCATGGTGT GCATCCACTG AAGGGGCCTT 
4S1 TGGTGCTGCA GAGGCCTACC TATCA 

ELLl.l DNA sequence 



ggtcgngctctacaagtaccgctgctacaaggtcaggccctggcctgccctccagccaccctt™ 

^CCJ^TGGCGCCAGGGCCGTOAAAC«KCGCCTtTA^ 
i-XATTTA. ...... ^ X . xci '''***°^'«GTCTCACTCCATTGCCCAGGCTGGAGTGCAGTGOTGC 

ELL1.2 DNA Sequence 



TGCTGATATACTGTATGTTGTl 



^*!!!*!^^"'^"'"°^"°<^**'^^TGCCCGTGGGCAGAG 

CTTTCTA 



Sl^^a^r^ ^"^"""^^'^^^^^^^^^^^^G^^CCAGCATTAO 

GaGGATGACCTGAGGGGAACTCCTTGGACITCTGTGCCC^ 
CCAGCGTAGCCATGAAGTAGCCTAATGAAGACKrATTaVGGCTTtSKrrATCAGT^ 
TACAATTtGTGGCGCTTGGGGACACCTTGTGATCG«GN;U.TTTCNGTTGTCTASS^TS?^^ 
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FIGURE 11 



PCR PRIMERS 





GGA 


CAC 


ACC 


CTC GGT GGG CC 


WWR 


CAA 


GGT 


AGA 


TAT AGG TGA 


EPIF 


CTT 


GGT 


GGT 


GCT CTA CAA GTA C 


EFIR 


GTA 


GTC 


CAT 


CGC CAC ATT GT 


WVfFlL 


TCC 


TCA 


ACT 


CCG TGC TGA AC 


WWRIR 


A6C 


ATC 


AGT 


GAA GAC ATG ATC 


1.3F 


ACC 


ACC 


GGG 


GCA ACC TQA 


X.3R 


AGG 


AAG 


CAG 


ATC TCA ATA GGA 


EP2F 


GAG 


AAG 


GTA 


AGA GCT GAC CC 


EF2R 


TGA 


TAG 


GTA 


GGC CTC TGC AGC 


921D12F 


TCC 


AGC 


CAC 


CCT TCT CTC C 


92ID12R 


CAG 


TAT 


TAG 


GTC ATC CAC GGT 


INTIR 


GTG 


GGG 


CAG 


ACG GAG AGA A 


ADC22L1 


CCC 


AAG 


TGT 


TCG TGG TGC TT 


ADC22R1 


ACT 


GTG 


GAC 


ATA GCG GTC AGG 


KM7749 


GGT 


AGA 


TAT 


AGG TGA AGA GG 


KM7750 


TCA 


CTG 


AGG 


ACA CAC CCT CG 



9 
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FIG, 12 



BNSOOaO <WO 9703192A3JA> 
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FIGURE 13 

Genomic sequence of stm.genA 

ctgcgccctgggggccctgccctgccctccgcacgcctctggccacggtcccttccccgg 
ctgtgggtctgcggcccctgcgtgcgcagctcctggcctctgcggccagcgcgggggcTO 
agagaggagagtgcccggcaggcggcggctgggccggcccggaactggJtlgtlglLl^ 
tcgcggggagcggccctcaggccttcggctcactgcgtccccacttccrtglgllcOTit 
gccgccgagccccggctgggggtgggcgcggcgcgagcggttaaagggccggtgcattta 

gcatgctttccccgggaaggaggcgcgcgggggctctgcccgcacgtgaggggcagggll 
?^?^^r''*^''''^*^*'''^'^^^*°^9^'=^5cagcggtgtttggct:|tlllltclggca 
tttccagcagtgaggagacagccagaagcaagcttttggagctgaaggaacctgagacag 
aagctagtcccccctctgaattttactgatgaagaaactgaggccallgagctaalgtgl 
^^^"r'="^?*'=2ff=^99^^=9atatagcagagccaggcttcgaccccagtgtcc€g 
gcttctagatctgctgtccatccctccgagcagacctcacccctgtttattgccttaata 
agtattccctttgaaaggtatgaacggtgttgagtgaagtaactgcatccctatttacaa 
atggagaacctgagagcattccatagagacgattgtagactaacttaactcagaagcgac 
agcctggggttgccaaggctgtctacgaagtaacttgattaggaccgaccccagcttcca 
gtaaggaagcctctgatgcctctgtagccaattctgcagacacctgagcctccaaggcct 

ggggccl:ggttcttggccccacctcagagctgcgtcctcacgacgcccacg€?glgcltt 
f?25!^^''^^?ff?!^'"'^^'*5*'9*999=^''99gggcatgttgctttgaagtgggatgg 
atgtatcaggtttttggggaaaactctgtaccctttggtgttgaagtcccatgtlLa 



.97031 92A3_IA> 
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FIGURE 14 

Genomic sequence of stm.genB 



aaaataaaccctacagataca 
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FIGURE 15 



caccacagatgtttgctgattgaatgaatgagcacactgacagtttggagctgccctgac 

tttcgtggctatgcgttttgccccctgggatgtgagtcacctcaggccagccccaggcaa 

ggccgctgctgcctccatggtaactctcaaggcctcttgttttatggcagtcgtttgatt 

gacaggcatctcttggaagcttttggggcaggacttgtgtccaagtctccaggtcgcctc 

cagccaccccctgagtcctccactgcctttgtctcacaggaaagtggaacaaggtccttg 

tgctcctttttccaggtgcttccagaggcagggctatgctcacattcatggcctctgaca 

gcgaggaagaagtgtgtgatgagcggacgtccctaatgtcggccgagagccccacgccgc 

gctcctgccaggagggcaggcagggcccagaggatggagagaanactgcccagtgggtag 

gtcccaccagcagctgggggccttcaaacaggtcctgcggctactgtaccttacagatga 

aaaccagacattcattccctgatgcgggagggagaagggaagtaatgatgaggattggcc 

gaaaaggtgggtggctggccatgatggaccttccatctgcagggtttcataggactgcgc 

attcacagccagagatggacttggcagtgggctgaagacgctgtccactctgccaccttg 

ggtttacctctctcatgcaggtcactgtttccactgtaataggagagtttgtttggatgc 

ctgggtgctaggacaggtaacacagaagcttaggatggtagcaggggaagcattttttgg 

cagatggccagacatggtaagtgtgagaggagtctgcctgatacacgattgacttttgag 

ctggggatatttgggcttcactgtgatcattcagcccccaggggaggagattgtaacgtt 

agaaagagtaggatatcgttgggagagccactragttgtgtcctttctctcccgatcagg 

gcagaacatctgaatttgcctgaaccctgttctctgttttgcccattatagaattaaaaa 

atgtctctgtgtggactgtttttttgcagccagtcttaatcctgcttgctgaaatttgag 

ctcacttctccatgttctccttgagaacggaaccatcgtccctaagccctgagtgaaatc 

acaccagcttaaggccactgctctgccactcctcagccttttcttgtttgttatctccgg 

gaagttttgtacactttggttgtttcagtttctgttcatgagtagtcttctttcttggct 

gaacgtctagattgggactctctctgcagagaaccggtactgaagcaactgtcattttca 

gtttttgtttcatttggctttttctttagctgttcaccttattagcaaggcagcccatga 

ccttgacttgccacagttccaaaacacaaattcttacagatcggtttgtgctagtgtctg 

gcaggtgtcctgccctccctcgttacctcctcatttgtgcctgcccaccttcccagagcc 

tgcgtcttctcagatgcttaacacctgtttagcctctctagttcagagctacaaatttac 

atgcttgattctgtggggcagaaagttcaaagtaatttcttcctctgcaaattcccagta 

tcttagtcacacgcaaagagagtgtccctgtgcactgactcctctagctagtgatttgtc 

agccaaaaatgtttatttatctcctggcctgtctcctcccatatcagtatggccacatga 

acagaattgagtgacctcctgagtccctgtattaggaaggggaaagatcttttgattcat 

taaccattaagttgattcattaaccattaagtcttgggcctgcagaccatagcaaccttc 

cttccttcatttatggtgcttcatccagctccaaatcttctctactttgtcctcacaaac 

ttttcatatgccctagtagctcatagactgctccttatatctggaaagcaacattcaaac 

ttctcatttctggttccaaaaatccgtgcattacatggataggctgccntgggggacatt 

cngcggccctcacgatgtggtttcccacagagaagccaggagaacgaggaggacggtgag 

gaggaccctgaccgctatgtctgtagtggggttcccgggcggccgccaggcctggaggaa 

gagctgaccctcaaatacggagcgaagcatgtgatcatgctgtttgtgcctgtcactctg 

tgcatgatcgtggtggtagccaccatcaagtctgtgcgcttctacacagagaagaatgga 

cagctgtgagttggggggctggggggagcagggtggggtgagggctgagttgccaggggg 

tggggggcgcagcagcctgtgttggtcactgtacctgcagctccacaccagcagcggtaa 

agagcagggatgaagaaccgcccaggttcatggcctggctcactgcctcctggattgtga 

cctacttgggcatgcttttaacatccctatgcctcagcttccttgttcgtataatgggtt 

gataacgcagttactgggagaattaagtgagtta^tatgagtgaagggcttagaagagtg 

tctactgcacgtgagtgctcaggcaagctggatcctgctgcagaaagcaagctettgatc 

ctgggcatggctgtgccactgatccctgtgtgactgcaaacaaatcacttcctctctgag 

tctctgcttccctgaatgtgaaacaaggtggttggaccagatatttctcagctcacttcc 

agccttgtgaggaagacttataaagcctttcgtttattttagtaaaatacatgcagaggc 

agcagcgtagaaaaatgagaagcttcctccacttcttccccctcccctttctgtggtcct 

cactgctaagcaccttctgtaaacttttttttttttttttaaagttaggganttttgttt 

catttcgtgtgtgttggttttttttgttgttgttgtttcttttaaagaaaggaataaggc 

caggtgtggtgtctcatgcctgtaatcccagcactttgggagactgaggtgagaggatta 

tttgagcccaggagtttgagaccagcctgggaaatgtggcgagaccctgtctgtacaaaa 

aatgcaaaaattagccaggtgtggtggtacatgcctgtagtctcagctacttgggagact 

gaggtggaagaacacctgagcccagaagtcgaggctgcagtgagccatgattgcgccact 

gcactgcagcctcagcaacagagtgagaccctgcctcaaaatttttttaa 
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FIGURE 16 




.970319243. 1 A> 
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FIGURE 17 

Genomic sequence of stm.genE 



Oil S'^^^^^^^^^ 

^f^^!^^^"^^^^^cacaagtggacatgggcatgaggacc?gg|cg|||aaagat|alc 
f!f?!?f^?"?^"^==*="gtgccagccgttttgggaaccllQgcIOcatcccc«^ 




r 
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FIGURE 18 

Genomic sequence of sta.genF 




^^^^^^^ 

issllIiiliaip.iS 
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FIGURE 19 



Genomic sequence of stm.genG 



cagggtctcgctctgttgcccaggctggagtgcagtagcacgattgtggctcactgcaaccttgacttcctgggc 

tcaagtgatcctcccaccttggcctcctgagtagctgggattacaggcacgtgctaccacacctggctcatttat 

atttttagtagagacaaggttttgacttgtrgcccgggctggtttcgaactcctgggctcaagtgatccacctgc 

ettggccttccaaagtactgggattataggtctgagccacagcacctggccaaaattagttttattataagagat 

gcaactcaggaccagccaaatgaagagacagtgaagaagtaatgctgatggatcacacctggtgggggaaggcgg 

acagctggggccaggagcaggagggacacctgcagggctggaagggcaggggaggtgggcccccatggtctgtgt 

ttattgcataaccarttttattgtctacagtgagcaaagttatcctataaacaagtgtcagggaccattgcacta 

aagaaaacaaatgagagcattttggaagctctaatttcctgatcagtaatgggtagactaattcccagttatatt 

tacctgttgtaaggtgaaaggttcttcagaggacctctgtcttggtgttatatgggcttttgaatgtactgaaat 

taaattccctaaaaanctgtgattcagacttcatactaaattgtacagcagtgcccagcccaaggccttgcattt 

ctatttgttgttttctttactctctaagtgcccaacactggttttacctgagtttcagaactgcccgcttttctc 

tgcccaggttgtaagtcacccagtccacaggtgccccctgctctcccactggccactgatttggggaggcagctg 

tccatgtccccagtccacatcttagcttctagaggccaggtggggtgggctgggctgggcaagagcagctgggcc 

ttctgggccagagtttctcttctttttccattctgtgcacgcctcttcagtacgggttactgtctctcctcacac 

agggggcgtgaagcttggcctcggggacttcatcttctacagtgtgctggtgggcaaggcggctgccacgggcag 

cggggactggaataccacgctggcctgcttcgtggccatcctcattgtgagtggctggggatgcgtccaactgce 

tcgtggtgggggcccccagggtcctcattgtggtgggggcaggcctcaggatccctagggatttttcatttcttc 

tcttcccrctgagggacaagagcagggagcggggctggaagggtcagcttgagaccaaggctcacaggaggtgtg 

ctcgcccctaggtgggctccagcctgtggaggacagtgcaggggagggtgaggagtgtaccggccccagcgtggc 

tgagcacacagcctccaggccgaggacccagctgacagctttgcgcagtgatgataccctcgaggtggttgtgat 

gacatcagatttgcagaaaagaaaattgctcaagggccttgcccatgggcgcaaagctagtgaggaccatgtttt 

ccccctcctccatgccattgggacaccacagggtctgaatctggggcactaggggtggccccgttactgtgaacc 

acagcagtgaaatgtggaggccctgtagtcagttaacgtggccagatacacataatggggagacgtcctgccgtg 

acttcatctcagagatttcgctgtcacgttagaggaggaggagcgtctgagccgtgcgcttggcatctgcccctt 

agtgaaaaccctgggcatggcatgattaaggttgatgctccagtgtccagaaggttttctrttttgcccacaagta 

tatcagggatgggatggtggacccaggctcctccaccaccagactgccttacctgagccctgctggccccaaaga 

xatagaaggcaccctggttccctgtgctcacctggaccactgcctgcatcagctgggtcaggggaggatgggcag 

cccccacacctgcttcccaggggcaggttgcctggcggctctgattcccttggtgccagctgcugagaaccttac 

rgccatttcagttgagcccacctagctctcatataaatacatgttccctgagggcatcttaccatcccatgtgac 

cactccagccagacaggggaggcagcacggcctcggggcacagcactgctccaggagtcaggaggcctgccttct 

ggttcactcactaacaggtgaggtgacctaatgggggtgagaacttctgcccttaacacctcaagagctgtrtgca 

ggaccagggaagataatggggtgtctagcgccgttatccgactggtcctcgaacaagctcctgtgcccagggact 

agaccatgactcacagctcctgtccacaccagggatcaccacgctcaccctcccctccatgtcctgcagggcttg 

tgtetgaccctcctgctgcttgctgtgttcaagaaggcgctgcccgccctccccatctccatcacgttcgggctc 

atcttttacttctccacggacaacctggtgcggccgttcatggacaccctggcctcccatcagctctacatctga 

gggacatggtgtgccacaggctgcaagctgcagggaattttcattggatgcagttgtatagttttacactctagt 

gccatatatttttaagacttttctttccttaaaaaataaagtacgtgtttacttggtgaggaggaggcagaacca 

gctctttggtgccagctgtttcatcaccagactttggctcccgctttggggagcgcctcgcttcacggacaggaa 

gcacagcaggtttatccagatgaactgagaaggtcagattagggcggggagaagagcatccggcatgagggctga 

gatgcgcaaagagtgtgctcgggagtggcccctggcacctgggtgctctggctggagaggaaaagccagttccct 

acgaggagtgttcccaatgctttgtecatgatgtccttgttattttattgcctttagaaactgagtcctgttctt 

gttacggcagtcacactgctgggaagtggcttaatagtaatatcaataaatagatgagtcctgttagaatcttgg 

agrttggtccgttgtaaatgttgacccctctccctgcatcttgggcacccctgggataacttgtgctgtgagccc 

aggatggaggcagtttgccctgtttgaaggaacttttaatgatctcgcctctctgcacacatttctttaactaga 

aagttrcctaagcaaaggagttaggagagcagggtggcctgacatctgccagccctgagctgtasggctgtggat 

gctgagcaggtccctggactcagttgtgcacggtggcacagacactgccaggtggttgccaaaacatccagtggt 

tccttcagcaagtgttcaccctccgcagaagcctgtgagggcctgagctcagaaaccactctcctttccttctct 

ggctttggccctgggcactgtggtgggagagtggacagtttggctttgccttctctgtacatcaatcatgggttg 

caaagagaatctcagaagtgcctcttcctgagcacagtggctcacacctgtaateccaatacttcgggaggtcga 

gtcggaaggatcacttgagcccaggagtttgagaccagccggggcaacatagtgagactttgtacaaaaaaaaat 

ttaaaaattagccaagcatggtggcatgcatctgtagtctcagctactrctggaggctgaggtgggaggatcactt 

aagcccaggaggctgaggctgcaatgagccgagatcaagcaggtgttaggtatatcagacagctgagaagacgca 
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